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KINASES AND PHOSPHATASES 
TECHNICAL FIELD 

S This iBvention relates to nucleic acid and amino acid sequences of kinases and phosphatases 

and to the use of these sequences in the diagnosis, trealment, and prevention of cardiovascular 
diseases, immune system disorders, neurological disord^ disordm affecting growth and 
development, lipid disorders, cell proliferative disorders, and cancm, and in the assessment of the 
effects of exogenous conqx>unds on the expression of nucleic acid and amino add sequences of 

10 kinases and phosphatases. 

BACKGROUND OF THE INVENTION 

Reversible protein phosphoiylation is (he ubiquitous strategy used to control many of the 
intracellular evmts in eukaryodc cells. It is estimated that more than ten pm:ent of proteins active 

15 in a typical mammalian cell are phosphorylated Kinases catalyze the transfer of higjh-en^cgy 
phosphate groiqps firom adenosine triphosphate (ATP) to targ^ proteins on the hydroxyamino acid 
residues serine, threonine, or tyrosine. Phosphatases, in contrast, remove these phosphate groups. 
Extracellular signals including hormones, neurotransmitters, and growth and differentiation factors 
can activate kmases , v^ch can occur as cell surface receptors or as the activator of the final 

20 effector protein, as weU as other locations along the signal transduction pathway. Cascades of 

kinases occur, as well as kinases sensitive to second messenger molecules. This system allows for 

/■ 

ttie an^lification of weak signals Gow abundance growth fector noolecules, for exanq)le), as well as 
the synthesis of many weak signals into an all-or-nothing response. Phosphatases, then, are 
essratial in determining die extcat of phosphorylation in the cell and, together with kinases, regulate 
25 key cellular processes such as metabolic enzyme activity, proliferaticMi, cell growth and 
differentiation, cell adhesim, and cell cycle progression. 
KINASES 

Kinases conqnise the largest known enzyme superfamily and vary widely ia their target 
molecules. Kinases catalyze the traiisferofMgh energy phosphate gnmps from a phosphate 

30 to a phosphate acceptor. NucleotidesusuaUy serve as the phosphate donor in these reactions, with 
most kinases utilizing ad^osine triphosphate (ATP). The phosphate acceptor can be any of a 
variety of molecules, including nucleosides, nucleotides, lipids, carbohydrates, and proteins. 
Proteins are phosphorylated on hydroxyamino acids. Addition of a phosphate group alters the local 
charge on the acceptor molecule, causing internal conformational changes and potentially 

35 influencing intermolecular contacts. Reversible protem phosphorylation is the primary method for 
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regulating protein activity in eukaiyotic cells. In genml, proteins aie activated by phosphoiylation 
in response to extracellular signals such as honnones, neurotransmitters, and growth and 
differentiation factors. The activated proteins initiate the cell's intracellular response by way of 
intracellular signaling pathways and second messenger luolecules such as cyclic nucleotides, 

5 calcium-calmodulin, mositol, and various mitogens, that regulate protein phosphorylation. 

Kinases are involved in aU aspects of a cell's function, from basic metabolic processes, such 
as glycolysis, to cell-cycle regulation, differentiation, and commmucation with the extracellular 
environment tbrougjh signal transduction cascades, hiappropriate phosphorylation of proteins in 
cells has been linked to changes in ceU cycle progiession and ceUdifferentiatira^ Changesinthe 

10 cell cycle have been linked to induction of apoptosis or cancer. CJianges in cell differentiation have 
been linked to diseases and disorders of the reproductive system, immune system, and skeletal 
muscle. 

There are two classes of protein kinases. One class, protein tyrosine kinases (FTKs), 
phosphoiylates tyrosine residues, and the otiier class, protein serine/threonine kinases (STKs), 

15 phosphorylates serine and threonine residues. Some PTKs and STKs possess structural 

characteristics of both families and have dual specificity for both tyrosine and serine/threonine 
residues. Almost all kmases contain a conserved 250-300 amino acid catalytic domain containing 
specific residues and sequence motifis characteristic of the Idnase fiomily. The protein kinase 
catalytic domain can be further divided into 11 subdomains. N4ennuial subdomains MV fold into a 

20 two-lobedstrocture which Innds and orirats the ATP donor molecde, and 
twolobes. C4enninalsubdomaiiis VI-XI bind the proteiu substrate and transfer 
phosphate firom ATP to the hydroxyl group of a tyrosine, serine, or threonine residue. Bach of the 
11 subdomains contains specific catalytic residues or amino acid motijs characteristic of that 
subdomain. For example, subdomain I contains an 8-amino acid glycin&-rich ATP binding 

25 consoisus motif, subdomain II contains a critical lysine residue required for Tnaxitnal catalytic 
activity, and subdomains VI through IX conqxrise the Mghly conserve PTKsand 
STKs also contain distinct sequence motifs in subdomains VI and VIQ which may confer 
hydroxyamino acid specificity. 

Id addition, kinases may also be classified by additional amino acid sequences, generally 

30 between 5 and 100 residues, which either flank or occur within the kinase domain. These additional 
amino acid sequences regulate kinase activity and detennine substrate specificity. (Reviewed in 
Hardie, G. and S. Hanks (1995) Ihe Protein KmaRft Farts ttnnV^ Vol I, pp. 17-20 Academic Press, 
San Diego CA). In particular, two protein kinase signature sequences have been identified in the 
kinase domain, the first containing an active site lysine residue involved in ATP binding, and the 

35 second containing an aspartate residue inq>ortant for catalytic activity. If a protein analyzed 
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includes the two protein kinase signatures, the probability of that protein being a protein kinase is 
close to 100% (PROSriB: PDOCXWIOO, November 1995). 
Protein Tyrosine Kinases 

Protein tyrosine kinases (PTKs) may be classified as either transmembrane, receptor FTKs 

5 or nontransmembrane, nonreceptor FTKprotdbiis. Transmembrane tyrosine kinases function as 
receptors for most growth factors. Growth factors bind to the receptor tyrosine kinase (RTK), 
which causes the receptor to phosphorylate itself (autqphosphorylation) and specific intracellular 
second messenger proteins. Growth factors (GF) that associate widi receptor FTKs include 
epidermal GF, platelet-derived GP, fibroblast GF, hepatocyte GP, insulin and insulin-like GFs, 

10 nerve GF, vascular CTdothelial GF, and macrophage colony stimulating &ctor. 

Nontransmemhrane, nonreceptor FTKs lack transmembrane regions and, instead, form 
signaling complexes with the cytosoHc domains of plasnia membrane receptors. Receptors that 
function Arough non-receptor FTKs include those for cytokines and hormones (growth hormone 
and prolactin), and antigen-specific leceptors oa T and B lymphocytes. 

15 Many FTKs were first identified ails cmcGgene products in canc^ cells in which PTK 

activation was no longer subject to normal cellular controls. In fisust, about one third of the known 
oncog^es encode FIKs. FurthCTUore, cellular transformation (oncogenesis) is often acconqianied 
by increased tyrosine phosphorylation activity (Charbonneau, H, and NJBL Tonks (1992) Annu. Rev. 
Cell Biol. 8:463-493). Regulation of FTK activity may tfamfore be an important strategy in 

20 c(»itrolling some types of cancer. 
Protein Serinc/Th ^^wiwe Tglwaigftg 

Protein semcAfareonine kinases (STKs) are nontransmeinbranep^ A subclass of 
STKs are known as IffltKs (extracellular signal regulated kinases) or MAPs (ndtpg^-activated 
protem kinases) and are activated after cell stimulation by a variety of hormones and growth factors . 

25 CeUstunulation induces a signaling cascade leading to phosphorylation of MEK^A^ 

kinase) which, in turn, activates ERK via serine and threonine phosphorylation. A varied number of 
proteins repres^t the downstream effectors for the active ERK and inq>licate it in the ccmtrol of cell 
proliferation and differentiation, as well as regulation of the cytoskeleton. Activation of ERK is 
normally transirat, and cells possess dual specificity phosphatases that are responsible for its down- 

30 regulation. Also, numerous studies have shown that elevated ERK activity is associated with some 
cancers. Other STKs include the second messenger dependent protein kinases such as the 
cyclic-AMP dependent protem kinases (PKA), calcium^almodulin (CaM) dependent protein 
kinases, and the mitogen-activated protein kinases (MAP); the cyclin-dependent protein kinases; 
checlq)oint and cell cycle kinases; Numb-associated kinase (Nak); human Fused (hFu); 

35 proliferation-related kinases; 5 -AMP-activated px>tein kinases; and kinases involved in apoptosis. 
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One member of the ERK family of MAP kinases, ERK 7. is a novel 61-kDa protein tbat has 
motif similarities to ERKl and EEIK2, but is not activated by extracellular stimuli as are ERKl and 
EEIK2 nor by the common activators, c-Jun N-tenninal kinase (INK) and p38 kinase. ERK7 
regulates its nuclear localization and inhibition of growth through its C-t»minal tail, not through the 
5 kinase domain as is typical with other MAP kmases (Abe, MX (1999) Mol. Cell. Biol. 
19:1301-1312). 

The second messenger dependent protein kinases primarily mediate the effects of second 
messengers such as cyclic AMP (cAMP), cyclic GMP, inositol triphosphate, phosphatidylinositol, 
3,4,S-triphosphate, cyclic ADP ribose, arachidonic add, diacylglycerol and calcium-calmodulin. 

10 The PKAs are involved m mediating homione-induced cellular responses and are activated by 
cAMP produced within the ceU in response to homione stimulation. cAMP is an intracellular 
mediator of honnoae action in all animal cells diat have been studied. Hdrmone-induced cellular 
responses include thyroid hormone secretian, Cortisol secretion, prp^sterone secretion, glycogen 
breakdown, bone resorption, and regulation of heart rate and force of heart muscle contraction. 

15 PKA is found in all animal cells andisfhoughttoaccountfor^efifects ofcAMP in most of these 
cells. Altered PKA expression is inq>ficated in a variety of disorders and diseases including c^ 
thyroid disorders, diabetes, athmsclerosis, and cardiovascular disease (Isselbacher, KJ. et'aL 
(1994) Harrison*s Principles of Internal MedidncL McGraw-Hill, New Yoric NY. pp. 416-431, 
1887). 

20 The casein kmase I (CKI) g^e family is another subfamily of serineAhreonine protein 

kinases. This continuously e^qmnding group of kmases have been inq>licated in the regulation of 
numerous cvtoDlasmic and nuclear processes, including cell metabolism and DNA replication, and 
repair. ClXIen^mes are present in the membranes, nucleus, cytoplasm and cytos^^ 
eukaryotic cells, and on the mitotic spindles of mammalian cells (Fish, KJ. et al. (1995) J. Biol. 

25 Chem. 270:14875-14883). 

The CKI family members aU have a short an]ino4erminal domain of 9-76 amino acids, a 
highly conserved kinase domain of 284 amino acids, and a variable carfooxyl-tenmnal domain that 
ranges fiom 24 to over 200 amino acids in length (Cegielska« A. et al. (1998) J. BioL C!hem. 
273:1357-1364). The CKI £unily is comprised of highly related proteins, as seen by flie 

30 identificationofisoformsof casein kmase I from a variety of sources. There are at least five 
mammalian isoforms, a, p,Y»5> and e. HshetaL identified CKL^silon from a human placenta 
cDNAlibrary. li is a basic protemof416 ammo adds and is closest to CKI-delta. Throu^ 
recombinant expression, it was detennmed to phosphorylate known CKI substrates and was 
inhibited by the CKI-specific inhibitor CKI-7. The human gene for CKLepsilon was able to rescue 

35 yeast with a slow-growth phenotype caused by deletion of the yeast CKI locus, HRR250 (Fish et al.. 
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Hie Tnammalian ciicadian nuitation tau was found to be a semidominant autosomal allele of 
CKI-epsilon that markedly short^s period lengdi of ciicadian ifaythms in Syrian hamsters. The tau 
locus is encoded by casein kinase I-epsUon, which is also a homolog of the Diosophila circadian 

5 gene double-time. Studies of both the wildlype and tau mutant CKI-epsilon enzyme in^ 

the mutant enzyme has a noticeable reduction in the maximum velocity and autophosphorylation 
state. Further, in vUro, CKI-epsilon is able to interact with mammalian PERIOD proteins, while the 
mutant enzyme is deficient in its atnlity to phosphoiylate PERIOD. Lowrey et al. have proposed 
that CKI-epsilon plays a major role in delaying the negative feedback signal within ib& 

10 tcanscription-translation-based autotegulatoiy loop that conqx>ses the core of the circadian 
mechanism. Therefore the CKI-epsilon enzyme is an ideal target for phannaceutical con^munds 
influmcing circadian ifaythms, jet-lag and sleep, in addition to other physiologic and metabolic 
processes under circadian regulation (Lowrey, PX. et aL (2000) Science 288:483-491). 
Caldum-Calmod iiHi nftpft odent Protp ™ gwiaRfts 

15 Caldum-cahnodulin depends (CaM) kinases are involved in regulation of smooth muscle 

contraction, glycogm breakdown (phosphorylase kinase), and neurotransmission (CaM kinase I and 
CaMkmasell). CaM depradent protein kmases are activated by calinodulm, an intracellular 
caldum receptor, in response to the concentration of free calcium in the cell. Many CaM kinases 
are also activated by phosphorylatioiL Some CaM kinases are also activated by 

20 autophosphorylation (V by oth^ regulatory kinases. CaMkmaselphosphorylatesavariety of 
substrates including the neurotransmittCT-related proteins synspsin I and n, die gene transcription 
regulator, CEUBB, and the cystic fibrosis conductance regulator protdn, CFTR (Haiibabu, B. et aL 
(1995) EMBO J. 14:3679-3686). CaM kinase n also phosphoarylates synapsin at difikrent sites and 
controls tito synthesis of catecholamines in the brain dirough phosphorylation and activation of 

25 tyrosine hydroxylase. CaM kixiasen controls the synthesis of catecholamines and seratonin, 
dirough phosphoiylation/activation of tyrosine hydroxylase and tryptophan hydroxylase, 
respectively (Pujisawa, HL (1990) BioOBssays 12:27-29). The mRNA ^icoding a cafanodulin-binding 
ptotem kinase-like protdn was found to be enriched in mammalian fMebrain. This protein is 
associated with vesicles in bodi axofis and doadrites and acaimnlates largely postnatally. The 

3Q ainino add sequence of this protem is siniilar to CaM^l^ndffltSTKs, and tte 

calmodulin in the presence of cateium (Godbout, M. et al. (1994) J. Neurosd. 14:1-13). 

Homeodomain-intoacting protein kinases (IflPKs) are serine/threonine kmases and novel 
members of die DYRK kinase subfamily (Hofiooann, T.O. et aL (2000) Biochunie 82:1123-1127). 
HIPKs contain a conserved protein Idnase domain separated from a domain that interacts with 

35 homeoprotBins. are nuclear kinases, and HIPK2 is Ughly expressed in iieurcHudti^ 
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YJEL et al. (1998) J. Biol. Chem. 273:25875-25879; Wang, Y. et al. (2001) Biochim. Biophys. Acta 
1518:168-172). HJPKsactascorepiesscmforhoineodoimantra^ This 
coiepiessor activity is seen in posttranslational modifications such as ubiquitination and 
phosphorylation, each of which are important in the regulation of ceUular protein function (Kim, 

5 YJLetal. (1999) Roc. Natl. Acad. Sd. USA 96:12350-12355). 

The human h-warts protein, a homolQg of Drosophila warts tumor suppressor gene, maps to 
chromosome 6q24-25.1. has a sedne/threonineldnase domain and is localized to centrosomesm 
interphase cells. It is mvolved in mitosis and functions as a componmt of the mitotic apparatus 
(Nishiyama, Y. et al. (1999) FEBS Lett 459:159-165). 

10 CMdum-O lmft'i'ifi" Dependent Ptotcin K inases 

Caldum-calmodulin depmdent (CaM) kinases are involved in regulation of smooth muscle 
contraction, glycogra breakdown (phosphorylase kinase^ and neurotransmission (CaM kinase I and 
CaMkinasen). dependent protein kinases are activated by calmodulin, an intra^ 
caldumrecq>tor,inreqKnisetotbeconcentratiraoffieecalduminthe^^ Many CaM kinases 

15 are also activated by phosphorylation. Some CaM kinases are also activated by 

autophosphorylation or by other regulatory Idnases. - CaM kinase I phosphorylates a variety of 
substrates including the neurotransmitt^-related proteins synapsin I and II, the gene transcription 
regulator, CREB, and the cystic fibrosis conductance regulator protein, CPTR (Haribabu, B. et al. 
(1995) EMBO J. 14:3679-3686). CaM kinase II also phosphorylates synapsin at diffimnt sites and 

20 controls the synthesis of catecholammes in the brain througji phosphorylation 

tyrome hydro^lase. CaM kinase II controls the synthesis of catecholamines and seratonin, 
through phosphorylation/activation of tyrosine hydroxylase and tryptophan hydroxylase, 
respectively (PUjisawa, H. (1990) BioEssays 12:27-29). The mRNA encodmg a calmodulin-binding 
protein Idnase-like protein was found to be enriched in manmialianfoiebra^ TMs protein is 

25 associated with vesicles in both axons and dendrites and accumnlates largely pos^ The 
amino acid sequence of this protein is similar to CaM-depeodent STKs, and the protein binds 
calmodulin in the presence of caldum (Godbout, M. et aL (1994) J. Neurosd. 14: 1-13). 
ftfltogea-Activatcd Protdn 

The mitpgen-activated protdn kniases (MAP), which mediate signal transduction ftom the 

30 ceUsur&ce to the nucleus via phosphorylation cascades, are another STK family 

intracellular signaling pathways. Sevoal subgroups have heca identified, and each manifests 
differoit substrate spedfidties and responds to distinct extracellular stimuli (Egan, S.R and RA. 
Weinberg (1993) Nature 365:781-783). There are three kinase modules conqnising the MAP kinase 
cascade: MAPK (MAP), MAPK kinase (MAP2K, MAPKK, or MKK), and MKK kinase (MAP3K, 

35 MAPKKK, OR MEKK) (Wang,X.S. et al (1998) Biochem. Biophys. Res. Commun. 253:33-37). 
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The extxacellular-iegulated kmase pathway is activated by growth factors and nntog^ns, for 
exaiiq>le, epidermal growth factor (EGF), ultraviolet light, hyperosmolar mediumi, heat shock, or 
endotoxic lipopolysaccharide (LPS). Hie closely related though distinct parallel pathways, the c- 
Jun N-temunal kinase (INK), or stress-activated kinase (SAPK) pathway, and the p38 kinase 
5 pathway are activated by stress stunuli and proinflammatory cytoldnes sudi as tmnor necrosis factor 
(TNF) and interleukin-l (IL-1). Altered MAP kinase expression is ini{>licated in a variety of disease 
conditions including canc^, inflammation, immune disordm, and disorders affecting growth and 
develq>inent. MAP kinase signaling pathways are present in mammalian cells as well as in yeast 
The family of p2 1 -activated protem kmases (PAKs) appear to be preset in all organisms 

10 that havr r4r47-ll^fP CWPasPA Tn maTmnaliaTi rftllg, PAKs hnvft heftn implicalBd in the activation of 

mitogen-activated prc^in kmase cascades. PAK functions also include the dissolution of 
cytoskeletal stress fibers and reorganization of focal complexes CManser, K et al. (1997) Mol. Cell 
Biol.l7(3):1129-1143). 
CYdin-Pepcpdmt Protein Kinases 

15 The cy clin-depmdent protem kinases (CDKs) are STKs that control the progressim of cells 

through the cell cycle. The entry and exit of a cell from mtosis are regulated by the synthesis and 
destructicn of a fsarSly of activating proteins called cyclins. Cyclins are small regulatory proteins 
that bind to and activate CDKs, which then phosphorylate and activate selected proteins involved in 
the mitotic process. CDKs are unique in that they require multiple inputs to become activated, hi 

20 addition to cyclin binding, CDK activation requkes the phosphorylation of a specific threra 
residue and tiie dephosphorylation of a specific tyrosine residue on the CDK. 

Another family of STKs associated with the cell cycle are the NIMA (nev^ in mitosis)- 
related kinases (Neks). Both CDKs and Neks are involved in duplication, maturaticm, and 
sqmration of the microtubule (xganizmg center, the cenlrosome, in animal cells (Pry, A.M. et al. 

25 (1998) EMBO J. 17:470-481). 

Checkpoint ^vA Cell Cyde Kinases 

In die process of cell division, the order and timing of cell cycle transitions are und^ 
control of cell cycle checkpcnnts, which ensure that critical evmts such as DNA replication and 
chromosome segregation are carried out with precision. IfDNA is damaged, e.g. by radiation, a 

30 checkpoint pathway is activated that arrests the ceU cycle to provide time for repaid Ifthedamage 
is extfflisive, apoptosis is induced, fii tiie absence of such checlqMMnts, the damaged DNA is 
ifih^'tM by aberrant cells which may cause proliferative disorders such as cancer. Protein kinases 
play an important role in tins process. F6r example, a specific kinase, checkpoint kinase 1 (Chkl), 
has been idratified in yeast and mammals, and is activated by DNA damage in yeast Activation of 

35 Odd leads to tiie arrest of tiie cell at the G2/M transition (Sanchez, Y. et al. (1997) Science 
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277:1497-1501). Specifically, Chkl phosphorylates the cell division cycle phosphatase CDC2S, 
inhibiting its normal function which is to dephosphorylate and activate the cyclin-dependent kinase 
Cdc2. Cdc2 activation controls the entiy of cells into nntosis (Peng» C.-Y. et al. (1997) Science 
277:1501-1505). Thus, activation of CMlprevmts the danmgedceU from entering n^^ A 

S deficiency in a checkpoint kinase, such as Chkl , may also contribute to cancer by failure to arrest 
cells with damaged DNA at other checkpoints such as G2/M. 
Proliferation-Related Kinases 

Proliferation-related kinase is a serum/cytokme inducible STK that is involved in regulation 
of the cell cycle and cell proliferation in human megakarocytic cells (U, B. et al. (1996) J. Biol. 

10 CSiem. 271:19402-19408). Ptoliferation-ielated kinase is related to die polo (derived from 
Drosophila polo g^e) family of STKs inq>licated in cell division. Proliferation-related kinase is 
downregulated in lung tumor tissue and may be a proteMmcogene whose deregulated expression in 
normal tissue leads to oncog^c transformation. 
S'^AMP-activated protein fc inafig 

15 A ligand-activated STK protem kinase is 5 -AMP-activated protein kinase (AMPK) (Gao, 

G. et al. (1996) J. Biol Gbem. 271:8675-8681). Mammalian AMPK is a regulator of fetty acid and 
sterol synti^is through phosphorylation of the enzymes acetyl-CoA carboxylase and 
hydroxymethylglutaryl-CoA reductase and mediates resprases of these patiiways to cellular stresses 
such as heat shock and depletion of glucose and ATP. AMPK is a heteiotrimeric complex 

20 comprised of a catdytic alpha subunit and two non-catalytic beta and gamma subunits that ^ 
believed to regulate tiie activity of the alpha subunit Subunits of AMPK have a mudi wider 
distribution in non-lipog^c tissues such as brain, heart, splem, and lung than e^q)^^ This 
distribution suggests that its role may cxteod beyond regulation of lipid metabolism alone. 
The RET (reairanged during transfection) proto^cog^e racodes a tyrosine kinase 

25 recq>torinvdvedinbothinultiple«idocrinenecpla8iatype2,aninheri 

IBrschspnmg disease, a developmental defect of enteric neurons. RET and its functional ligand, 
glial cell line-derived neurotrc^hic factor, play key roles in the development of the human enteric 
nervous system (Pachnis, V. et al. (1998) Am. J. Physiol. 275:0183-0186). 
Kiiiftfigfi i n Apoptosis 

30 Apqptosis is a highly regulated signaling pathway leading to cell death that plays a crucial 

role in tissue development and homeostasis. Deregulation of this process is associated with the 
patiiogenesis of a number of diseases including autoimmune diseases, neurodegenerative disorders, 
and cancer. Various STKs play key roles in this process. 23P kinase is an STK containing a 
C-terminal leucine zipper domain in addition to its N-terminal protein kinase domain. This 

35 r-tftrmiTtal rinmain appears fp wf^mt^- ^nnlfy^i1Tlft^^7flHnn and activation of the kinase as well as 
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mtffl^tioBS with transcription factors such as activating transcription factor, ATP4, a memb^ of the 
cyclic-AMP responsive element binding protein (ATP/CREB) family of transcriptional factors 
(Sanjo, EL et al. (1998) J. Biol. Ghem. 273:29066-29071). DRAKl and DRAK2 are STKs that 
share homology with the death-associated protein kinases (DAP kinases), known to function in 

S interferon-Y induced apoptosis (Sanjo et al.. supra) . Like Z3P kinase, DAP kinases contain a 
C-terminal protein-protein interaction domain, in the form of ankyxin repeats, in addition to the N- 
terminal kinase domain. 23P, DAP, and DRAK kinases induce morphological changes associated 
. with apoptnsis when transfected into NIH3T3 cells fSanjo et al., supra) . However, deletion of either 
the N-terminal Idnase catalytic domain or the C-terminal domain of these protdns abolishes 

10 apoptosis activity, indicating that in additioa to the kinase activity, activity in the C-4enninal domain 
is also necessary for apoptosis, possibly as an interacting domain with a regulator or a specific 
substrate. 

RICK is another STK recentty identified as mediating a specific apoptotic pathway 
involving tiie death recq>tor, CD9S (Inohara,'N. et al. (1998) J. Biol. Chem. 273:12296-12300). 

15 CZD9S is a member of the tumor necrosis factcxr receptor superfamilym^ 

regulation and homeostasis of tiie immune system (Nagata, S. (1997) Cell 88:355-365). The CD95 
rec^tor signaling pathway involves recruitment of various intracellular molecules to a receptor 
complex following ligand binding. This process includes recruitment of the cysteine protease 
caspase-8 which, in turn, activates a caspase cascade leadmg to cell death. RICKiscQnq;K>sedof an 

20 N-tenninal kinase catalytic domain and a C4mninal**caspaseH:ecnn 

with caspase-like domains, indicating tiiat SICK plays a role in the recruit^^ This 
mterpretation is siqypogrtedby the &ct that the ejqxression of RICK in human 293T cells promotes 
activation of caspase-8 and potentiates the induction of apoptosis by various proteins involved in the 
CD95 apoptosis pathway (biohara et al., supra) * 

25 Mitochondrial Protein Kinajees 

A novel class of eukaiyotic kinases, related by sequ^ice to prokaryotic histidine protein 
kinases, are the mitochondrial protein kinases (MPKs) which seem to have no sequence similarity 
with other eukaryotic protein kinases. These protein Idnases are located exclusively hi the 
mitochondrial matrix space and may have evolved firom g^nes originally present in respiration- 

30 dependent bacteria which wmendocytosed by primitive eukaryotic cells. MPKs are responsible 
for phosphofylation and inactivation of the branched-chain alpha-ketoacid dehydrogenase and 
pyruvate dehydrogenase conqplexes (Harris, RA. et al. (1995) Adv. Bozyme ReguL 34:147-162). 
Kve MPKs have been identified. Four members correspond to pyruvate dehydrogenase Idnase 
isozymes, regulating the activity of die pyruvate ddbydrogenase conq>lex, which is an important 

35 regdatoryraizyrne at the mterfiace between glycolysis and the d Tlie fifth member 
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conesponds to a branched-chain alpha-ketoadd dehydiogenase kinase, inqioitant in the regulation 

« 

of the pathway for the disposal of branched-chain amino acids. (Hams, R. A. et al. (1997) Adv. 
Enzyme Regul. 37:271-293). Both starvation and the diabetic state are known to result in a great 
increase in the activity of the pyruvate dehydrogenase kinase in the liver, heart and muscle of tte 
5 rat This inoiease contributes m both disease states to the phosphorylation and inw^ 
pyruvate ddiydrogenase conq)lex and conservation of pyruvate and lactate for gluconeogenesis 
(Harris (1995) supra) . 

lONASR S WITH NON-PROTEIN SUBSTRATES 

10 Lipid and Inoritol Irin nsfig 

Lipid kinases phosphorylate hydroxy! residues on lipid head groups. A family of kinases 
involved in phosphorylation of phosphatidylinositol (PI) has been described, each member 
phosphorylating a specific carbon on the inositol ring (Leevers, S. J. et al. (1999) Curr. Opin. CelL 
Biol. 1 1:219-225). The phosphorylation of phosphatidylinositol is involved in activation of tte 

15 protein kinase C signaling pathway. The inositol phospholipids (phosphoinositides) intracellular 

• ftj gnalmg pathway begins with binding of a signaling molecule to a G-protein linked receptor in the 
plasma membrane. This leads to the phosphorylation of phosphatidylinositol (PI) residues on the 
inner side of the plasma membrane by inositol kinases, thus converting PI residues to the 

• biphosphate state (PIPj). PIP2 is then cleaved into inositol triphosphate (IP3) and diacylglyceroL 
20 These two products act as mediators forsq>axate signaling pathways. Cellular respcmses that axe 

mediated by these pathways are glycogra breakdown in the liver in response to vasopressin, smooth 
muscle contraction in response to ac^lcholine, and tiuromhin-induced platelet aggregation. 

PI 3-kinase (PBK), which phosphorylates the D3 position of PI and its d^vatives, has a 
central role in growth factor signal cj^scades involved in cell growth, differratiaticm, and 

25 metaboUsm. PBK is a heteiodinierccmsisting of an adapter subunit and a catalytic subunit The 
adapter subunit acts as a scaffolding protein, interacting with specific tyrosine-phosphorylated 
proteins, lipid moieties, and oth^ cytosolic factors. When the adapter subunit binds tyrosine 
phosphorylated targets, such as the insulin responsive substrate (IRS>1, the catalytic subunit is 
activated and converts PI (4,5) hisphosphate (PIP2) to PI (3,4,5) P3 (PIP3). PIP3 then activates a 

30 number of other proteins, including PKA, protein kinase B (PKB), protein kinase C (PKQ, 

glycogen synthase kinase (GSK>-3, and p70 ribosomal s6 kinase. PBK also interacts directiy with 
the cytosfceletal organizing proteins, Rac, iho, and cdc42 (Shepherd, P.R. et al. (1998) Biochem. J. 
333:471-490). Animal models for diabetes, such as obese zndfat mice, have altered PBK adapter 
subunit levels. Specific mutations in the adapter subunit have also been found in an insulin-resistant 

35 Danish population, suggesting a role for PBK in tvpe-2 diabetes (Shepard, supra) . 
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An o%sanpl& of lipid kinase phosphorylation activity is the phosphorylation of 
D-erythio-sphingosine to the sphingolipid metabolite^ sphingosine-l-phosphate (SPP). SPP has 
emerged as a novel lipid second-messeng^ with both extracellular and intracellular actions 
(Kohama, T. et al. (1998) J. Biol. Chem. 273:23722-23728). Extracellularly, SPP is a ligand for the 

5 Cr-piotein coupled rec^tor EDG-1 (endotfaelial-derived, G-protein coupled receptor). 

hitracellularly» SPP regulates cell growth, survival, mc^ty, and cytoskeletal changes. SPP levels 
are regulated by sphingosine kinases that specifically phosphoiylate D-erythro-sphingosine to SPP. 
The inqxMtance of sphingosine kinase in cell signaling is indicated by the fact that various stimuli, 
including platelet^lerived growth factor (PDGF), n^e growth factor, and activation of protein 

10 IdiiaseQincreaseceUularlevelsof SPP by activation of sphingosine kinase, and the £e^ 
competitive inhibitors of the en^n^ selectively inhibit cell jnoliferation induced by PDOF 
(Kohama et al., supra) , 

PKC is also activated by diacylglycerol (DAG). Phocbol esters (PB) are analogs of DAG 
and tumor promoters that cause a variety of physiological changes when administered to cells and 

15 tissues. PB and DAG bind to the N-4ermiiial region of PKC This region contains one or more 
copies of a cysteinenich domain about SO amino-acid residues long and essential for 
DAG/P&-binding. Diacylglycerol kinase (DGK), die enzyme tiiat converts DAG into phosphatidate, 
contains two copies of the DAG/PE4»nding domain m its N-tenninal section (Azzi, A. et ai. (1992) 
Eur. J. Biochem. 208:547-557). 

20 An exanqple of lipid kinase phosphoiylatian activity is*the phosphorylation of 

D-erythro-sphingosine to the sphingolipid metabolite, sphingosine-li[>hosphale (SPP). SPP has 
emerged as a novel lipid seccmd-messenger with both extracellular and intracellular actions 
(Kohama. T. et al. (1998) J. Biol. Chem. 273:23722-23728). Extracellularly. SPP is a ligand for flie 
Gprotein coiq)led receptor EDG-1 (endothelial-derived, G-protein coupled receptor). 

25 IhtraceUularly, SPP regulates ceU growth, survival, rnotiUty, and cytoskelet^ SPP levels 

are regulated by sphingosine kinases that specifically phosphorylate D-erythro-sphingosine to SPP. 
The iiiq;)ortance of sphingosine kinase in cell signaling is indicated by the fact that various stimuli, 
including platelet-derived growth factor (PDGF), nerve growth factor, and activation of protein 
kinase C, increase cellular levels of SPP by activation of sphingosine kinase, and the fact that 

30 conq)etitive mhibitors of the enzyme selectively inhibit cell proliferation induced by PDGF 
(Kohama et al. supra) . 
Purine Nucleotide Kina^ 

The purine nucleotide kinases, adenylate kinase (ATP: AMP phosphotransferase, or AdK) 
and guanylate kinase (ATP:GMP phosphotransferase, or GuK) play a key role in nucleotide 

35 metabolism and are crucial to the synthesis and regulation of cellular levels of ATP and GTP, 
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respectively. These two molecules are precursors in DNA and RNA synthesis in growing cells and 
provide the primary source of biochemical energy in cells (ATP), and signal transduction pathways 
(OTP). InUbition of various steps in the synthesis of these two rm>lecu^^ 
many antiproliferative drugs for cancer and antiviral therapy (Pillweio, K. et al. (1990) Cancer Res. 
5 50:1576-1579). 

AdK is found in almost all cell types and is especially abundant in cells having high rates of 
ATP synthesis and utilization such as skeletal muscle. In these cells AdK is physically associated 
with mitochondria and myofibrils, the subcellular structures that are involved in raergy production 
and utilizaticm, respectively. Recent studies have demonstrated a major fimction for AdK in 

10 transferring Mgh energy pho5ph(»yls from metaboUc processes generating ATP t^ 

components consummg ATP (Zelezoikar, RJ. et al. (1995) J. Biol. Oiem. 270:7311-7319). Thus 
AdK may have a pivotal role in maintaining energy production in cells, particularly tiiose having a 
hi£^ rate of growth or metabolism such as cancer cells, and may provide a target for siq>pression of 
its activity in order to treat certain cancers. Alt^natively, reduced AdK activity may be a source of 

15 various inetaboUc,muscle^ergydis(mler5 that can result in canliac or respi 
be treatable by mcieasing AdK activity. 

OuK, m addition to providing a key stq> in the synthesis of GTP for RNA and DNA 
synthesis, also fulfiUs an essential function in signal transduction pathways of cells tiirough the 
regulation of GDP and GTP. Specifically, GTP bmding to membrane associated G protdns 

20 inediates the activation of ceU receptors, subsequmtintraceUularactivaticm of adenyl eye 
production of the second messenger, cyclic AMP. GDP bmding to G proteins inhibits these 
processes. GDP and GTP levels also control the activity of certaia oncogenic proteins s^^ 
Imown to be involved in control of cell proliferation and oncogenesis (Bos, J.L. (1989) Cancer Res. 
49:4682-4689). Ifig|i ratios of GTP:GDP caused by suppressim of GuK cause activation of p2V^ 

25 and promote oncogenesis. Increasing GuK activity to increase levels of CK>P and reduce the 
GTP:GDP ratio may provide a therapeutic strategy to reverse oncogenesis. 

GuK is an inqportant enzyme in the phosphorylation and activation of certain antiviral 
drugs useful in the treatment of herpes virus infections. These drugs include the guanine homologs 
acyclovir and buciclovir (Miller, W.a and RX. Miller (1980) J. Biol. Chem. 255:7204-7207; 

30 Stenberg, K. et al. (1986) J. Biol. Oiem. 261:2134-2139). hicreasing GuK activity in infected cells 
may provide a therapeutic strategy for augmmting the effectiveness of these drugs and possibly for 
reducing the necessary dosages of the drugs. 
Pyrimidinc Kinases 

The pyrimidine kinases are deoxycytidine kinase and thymidine Idnase 1 and 2. 

35 Deoxycytidine Idnase is located in the nucleus, and thymidine kinase 1 and 2 are found in the 
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cytosoi (Johansson, M. et al. (1997) Proc. Nail. Acad. Sci. USA 94: 1 1941-1 1945). Phosphoiylation 
of deoxyribonucleosides by pyrimidine kinases provides an alternative pathway for de novo 
synthesis of DNA precursors. The role of pyrimidine kinases, like purine kinases, in 
phosphorylation is critical to the activation of several chemotherapeutically important nucleoside 
5 analogues (Am^ RS. and S. ]&iksson (199S) Pharmacol. Ther. 67: lSS-186). 

PHOSPHATASES 

Protein phosphatases are ^nerally charactoized as eith^ serine/threonine- or tyrosine- 
specific based on their preferred phospho-ammo add substrate. However, some phosphatases 

10 (DSPs, for dual specificity phosphatases) can act on phosphorylated tyrosine, serine, or threonine 
residues. Hie protein serine/threonine ph<^hatases (PSPs) are inq>ortant regulators of many 
cAMP-mediated honncnie responses in cells. Protein ^rosine phosphatases (PTPs) play a 
significant role in cell cycle and cell signaling processes. Another£aimly of phosphatases is the 
add phosphatase or histidine acid phosphatase (HAP) ftmily whose menDbers hydrolyzB phosphate 

15 esters at acidic pH conditions. 

PSPs are found in the cytosoi, nucleus, and nutochmdria and in association with 
cytoskeletal and membranous structures in inost tissues, especiaUy the brai^ Some PSPs require 
divalrat cations, such as Ca^ or Nbi^, for activity. PSPs play important roles m glycogen 
metabolism, muscle CQntraction, protein synthesis, T cell function, neuronal activity, oocyte 

20 maturation, and hepatic metabolism (reviewed in Cohen, P. (1989) Annu. Rev. Biochem. 58:453- 
508). PSPs can be sqraiated into two classes. The PPP class mcludesPPl,PP2A, 
PP2B/calcmeurin, PP4, PP5, PP6, and PP7. Members of this class are conq)osed of a homologous 
catalytic subunit bearing a very U^y consoved signature sequence, coi^led with one or more 
regulatcxy subunits (PROSIFE PDOOWl IS). Further interactions with scaffold and anchoring 

25 molecules detesinine the intraceUular localization of PSPs and substrate spec^ The PPM class 
consists of several closely related isoforms of PP2C and is evolutionarily unrelated to the PPP class. 

PPl dephosphorylates many of die proteins jdiosphorylated by cyclic AMP-dqp^adent 
protein kinase (E^KA) and is an inqiortant regulator of many cAMP-mediated hormone responses in 
cells. A nuniber of isoforms have been identified, with the alpha and beta forms being produced by 

30 alternative splicing of the same gene. Both ubiquitous and tissue-specific targeting protems for PPl 
have been identified, hi the brain, inhibitim of PPl activity by the dopamine and adenosine 3',5 - 
monophosphateHregulated phosphoprotein of 32kE>a (DARPP-32) is necessary fen: normal dopamine 
response in neostriatal neurons (reviewed in Price, NB. and M.C. Mumby (1999) Curr. Opin. 
NeuiobioL 9:336-342). PPl, along with PP2A, has been shown to linait motility in microvascular 

35 endothelial cells, suggesting a role for PSPs in the inhibition of angiogenesis (Gabel, S. et al. (1999) 
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Otolaiyngol. Head Neck Suxg.l21:463^68). 

PP2A is the main serine/thieonine phosphatase. The core PP2A ^izyme consists of a smgle 
36 kDa catalytic subunit (C) associated with a 65 kDa scaffold subunit (A), whose role is to lecniit 
additional regulatory subunits (B). Three gene families ^coding B subimits are known (PRSS, 

5 FR61» and PR72), each of which contain multiple isofonns, and additional families may exist 
(Nfillwaid. T.A et al. (1999) Trends BioscL 24: 186-191). These *B-type" subunits are cell type- and 
tissue-spebific and detennine the substrate specificity, enzymatic activity, and subcellular 
localization of the holo^DLzyme. IlieFRSSfamUy is highly cons^ed and bears a conserved motif 
(PROSITE PDOC0078S). PRSS mcreases PP2A activity toward mitpgen-activated protein kinase 

10 (MAPK)andMAPKkmase(MEK). PP2A dephosphorylates the MAPK active site, mhibiting the 
cell's entry into mitosis. Several proteins can compete with PRSS for FP2A core esDzywR binding, 
including the CKH kinase catalytic subunit, polyomavirus middle and small T antigens, and SV40 
small t antigqa. Viruses may use this mechanism to commandeer PP2A and stimulate progression 
of the ceU throu^ tfie cell cycle (PaUas, D.C et al. (1992) J. Virol. 66:886-893). Altered MAP 

15 kinaseexpiesdonisalsoinaplicatedinavariety of disease conditions including cancer, 

inflammation, immune disorders, and disorders affecting growth and development PP2A, in fact, 
can dephosphorylate and modulate the activities of more than 30 protein kinases i^.^^ * 
evidence suggests that the same is true in vivo for such kinases as PKB, PKC, the calmodulin- 
dependrat kinases, ERK family MAP kinases, cyclin-dep^dent kmases, and fbe LdB kinases 

20 (reviewed in Millward et aL. supraV PP2A is itself a substrate for CXI and kinases, and can 
be stimulated by polycationic macromolecules. A PP2A-like phosphatase is necessary to maintam 
the Gl phase destructicm of mammalian cyclms A and B (Bastians, R et al. (1999) MoL Biol. Cell 
10:3927-3941). PP2A is a major activity in the brain and is unplicated in regulating neurofilament 
stability and normal neural function, particularly the phosphorylation of the microtubul&-a5sociated 

25 proteintau. Hypeiphosphoiylation of tau has been proposed to lead to the neuronal degene^ 
se^ in Alzheimer's disease (reviewed in Price and Mumby, sunraX 

EP2B , or caldneurin, is a Ca^-activated dimeric phosphatase and is particularly abundant in 
the brain. Itconsistsof catalytic and regulatory subunits, and is activated by the binding of the 
calcium/calmodulin conqilex. Calcineurin is the target of the immunosiqipressant drugs 

30 cyclosporineandFE^06. Along with other ceUular factors, these drugs interact with calcineurin 
and inhibit phosphatase activity. In T cells, this blocks the calcium dependent activation of the NF- 
AT family of transcription factors, leading to immunosi^ppression. This family is widely distributed, 
and it is likely that caldneurin regulates gpne expression in other tissues as well. In neurons, 
calcineurin modulates functions which range fix)m the inhibition of neurotransmitter release to 

35 desensitization of postsynaptic NMDA-iQcept{»r coupled cddum channels to long ter^ 
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(reviewed in Price and Mumby, suQii). 

Oth^ membeis of the EPP class have lecenfly been identified (Cohen, P.T. (1997) Tiends 
Biochem. Sci. 22:245-251). One of them, PPS, contains regulatory domains with tetratricopeptide 
repeats. It can be activated by polyunsaturated fatty acids and anionic phospholipids in vitro and 
5 appears to be involved in a number of signaling pathways, including those controlled by atrial 
natriuretic peptide or steroid hormones (reviewed in Andreeva, A.V. and M.A. Kutozov (1999) Cell 
Signal. 11:555-562). 

PP2C is a ^2ldDa monomer.widi broad substrate specificity and is dq)endent on divalent 
cations (mainly Mn^ orMg^) for its activity. PP2C proteins share a conserved N-teiminal region 

10 with an invariant IXMn&otif^wMch contains an aspartate residue involved!^ 

(PROSITE PD(XX)0792). Targeting proteins and mechaniaaos regulating PF2C activity have not 
be^ identified. PP2C has been shown to inMbit the stiess-^responsive p38 and Jun kmase (J>^ 
pathways (Takekawa, M. et al. (1998) EMBO J. 17:47444752). 

In contrast to PSPs, tyrosine-specific phosphatases (PIPs) are ^erally monomeric proteins 

15 of very divttse size (fiom20]d>a to greats than l(X)ldDa) and stnicture that fimctioni^^ 
transduction of signals across the plasma membrane. PIPs are categorized as either soluble 
phosphatases or traiisii]einbranerecq>torpn)teins that contain a pho^ AllPTPs 
share a conserved catalytic domain of about 300 ammo acids which contains the active site. The 
active site consensus sequence includes a cystdne residue which executes a nuclepphilic attack on 

20 the phosphate moiety during catalysis (Neel, E.G. and N.K. Tonks (1997) Curr. Opin. Cell Biol. 
9: 193-204). Receptor FTPs are made up of an N4erminal extracellular domain of variable length, a 
transmembrane re^on, and a cytoplasmic regim that generally contains two cagits of the catalytic 
domain. Although only tiie first copy seems to have enzyriutic activity, the second copy 
afiects the substrate specificity of the first The extracellular domains of some receptor PTPs 

25 contain fibronectin^ike repeats, iimimoglobulin-likedomai]^ 

motif likely to have an adhesive functicHi), or carbonic anhydrase-like domains (PROSITE PDOC 

00323). This wide variety of stnicturaliiiotife accounts for the diversity in size a^ 

PTPs. 

FTPs play unportant roles in biological processes such as cell adhesion, lymphocyte 
30 activation, and cell proliferation. FTPs and k are involved m cell-cell contacts, perhaps regulating 
cadherin/catenin function. A number of PTPs affect cell spreading, focal adhesions, and cell 
motility, most of them via the integrin/tyrosine kinase signaling pathway (reviewed in Neel and 
Tonks, supra) . CD45 phosphatases regulate signal transduction and lynq)hocyte activation 
(Ledbetter, J. A. et al. (1988) Ptoc. Nail. Acad. Sci. USA 85:8628-8632). Soluble PTPs containing 
35 Src-homology-2 domains have been id^tified (SHPs), suggesting that these molecules might 



15 



wo 02/46384 



PCTAJSOl/47431 



interact with receptor Qrosine Idnases. SHP-1 regulates cytokine receptor signaling by controlling 
the Janus family FTKs in hematopoietic cells, as weU as signaling by the T-cell receptor and c-Kit 
(reviewed in Neel and Tonks, supra) . M-phase inducer phosphatase plays a key role in the 
induction of mitosis by dephosphorylatmg and activating the FTK CDC2, leading to cell division 

5 (Sadhu, K. et al. (1990) Proc. Natl. Acad. ScL USA 87:5139-5143). In addition, the genes encoding 
at least eight PTPs have been mapped to chromosomal regions that are translocated or rearranged in 
various neoplastic conditions, including lynqihoma, small ceU lung carcinoma, leukemia, 
. adenocarcinoma, and neuroblastoma (reviewed in Oiarbonneau, H. and N.EL Tonks (1992) Annu. 
Rev. Cell Biol. 8:463-493). The FTP enzyme active site comprises the consensus sequence of the 

10 MTMI gene &mily. The MTMl gene is responsible for X-linked recessive myotubular myopathy, 
a congenital muscle disorder that has been linked to Xq28 (Kioschis, P. et al., (1998) Genomics 
54:256-266). Many VTKs are oicoded by oncogenes, and it is well known that onco^nesis is often 
accorqpanied by increased tyrosine phosphorylation activity. It is therefore possible diat PTPs may 
serve to prev^ or reverse cell transformation and the growth of various cancers by controlling the 

15 levels of tyrosine phosphorylation in cells. This is supported by studies showing that 

overexpressicn of FTP can suppress transformation in cells and that specific inhibition of FTP can 
enhance cell transformation (Charbonneau and Tonks, supra) . 

Dual specificity phosphatases (DSPs) are structurally more similar to the PTPs than the 
PSflPs. DSPs bear an extended PTP active site motif with an additiorial 7 ainino acid residues. DSPs 

20 areprunarily associated with ceU proliferation and include the ceUcycl^ 
C The phosphatases DUSPl and DUSP2iiiactivate the MAPKfanmlymiKnb 
signal-regulated kinase), JNK (c-Jun N-temunal kinase), and p38 on both tyrosine and threonine 
residues flPROSITB PDOC 00323, supra) . In the activated state, Aese kinases have been iinplicated 
in neuronal diCferentiation, proliferation, oncogenic transformation, platelet aggregation, and 

25 apoptosis. Thus, DSPs are necessary for proper regulation of these processes (Muda, M. et al. 
(1996) J. Biol. Chem. 271:27205-27208). The tumor suppressor PTEN is a DSP that also shows 
lipid phosphatase activity. It seems to negatively regulate interactions with the extracellular matrix 
and wiaintflins sensitivity to apoptosis. PTEN has been implicated in the prevention of angiogmesis 
(Giri, D. and M Ittmann (1999) Hum. Pathol. 30:419-424) and abnormalities in its expression are 

30 associated with numerous cancers (reviewed in Tamura, M. et al. (1999) J. Natl. Clancer Ihst 
91:1820-1828). 

Histidine acid phosphatase (HAP; EXPASY EC 3.1.3.2), also known as add phosphatase, 
.hydrolyzes a wide spectrum of substrates including alkyU aryl, and acyl orthophosphate monoest^ 
and phosphorylated proteins at low pH. HAPs share two regions of conserved sequences, each 
35 cent»:ed around a histidine residue which is involved in catalytic activity. Members of the HAP 
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family include lysosomal acid phosphatase (LAP) and prostatic acid phosphatase (PAP), both 
sensitive to inhibition by Lrtaitiate (PROSITB PDOCXWSSS). 

LAP, an orthophosphoric monoester of the endosomal/lysosomal conq)aitnient is a 
housekeeping gene whose enzymatic activity has been detected in all tissues examined (CSeier, C. et 

5 al. (1989) Bur. J. Biochem. 183:611-616). LAP-deficient mice have progressive skeletal disorder 
and an increased disposition toward generalized seizures (Saftig, P. et al. (1997) J. Biol. Ghent 
272: 18628-1863S). LAP-deficient patients were found to have the following clinical features: 
intermittent vomiting, hypotonia, lethargy, opisthotonos, tmninal bleeding, seizures, and death in 
early infancy (Online Mendelian Inhmtance in Man (OMIM) *2009S0). 

10 PAP, a prostate epithelium-specific differentiation antigen produced by the prostate gland, 

has been used to diagnose and stage prostate cancer, hi prostate carcinomas, the enigmatic activity 
of PAP was shown to be decreased conqmred with normal or benign prostate hypertrophy cells 
CFoti, A. a et al. (1977) Cancer Res. 37: 4120-4124). Two forms of PAP have been identified, 
secreted and intracellular. Mature secreted PAP is detected in the seminal fluid and is active as a 

IS gly^^'^y^^*^ hnmndiiner with a Tnolecnlar weight of qpprpidmately lOQ-kilodalton. Intracellular 
PAP is found to exhibit endogenous phosphotyrosyl protein phosphatase activity and is involved in 
regulating prostate ceD growth (Meng, T.C and Lm, M.F. (1998) J. BioL Oiem. 34: 22096-22104). 

Synaptojanin, a polyphosphoinositide phosphatase, dephosphorylates phosphoinositides at 
positions 3, 4 and 5 of the inositol ring. Synaptojanin is a major presynaptic protein found at 

20 clathrin-coated endocytic intermediates in nerve terminals, and binds the clathrin coat-associated 
piotem, EPSIS. This bmding is mediated by the C4eniiinal region of synaptojanin-170, which has 
3 Asp-Fto-Phe amino add repeats. Further, this 3 residue repeat had bem found to be the binding 
site for the EH domains of EPS15 (Haffiaer, C. et aL (1997) FEBS Lett 419: 175-180). Additionally, 
synaptojanin may potentially regulate interactions of endocytic proteins with the plasma membrane, 

25 and be involved m synaptic vesicle recycling (Brodin, L. et al. (2000) Curr. Opin. Neurobiol. 
10:312-320). Studies m mice witii a targeted disruption m the synaptoganin 1 gene (Synj 1) were 
shown to suppoit coat foonation of endocytic vesicles more efiectivdy than was sera in wild-type 
mice, sug^ting that Synjl can act as a negative regulator of memibrane-coat protein interactions. 
Hiese finding s provide graetic evidence for a crucial role of phosphoinositide m^abolism in 

30 synaptic vesicle recycling (Cremona, O. et al. (1999) CeD 99:179-188). 

The discovery of new kinases and phosphatases, and the polynucleotides encoding them, 
satisfies a need in the art by providing new conq>ositions which are useful in the diagnosis, 
preventicm, and treatment of cardiovascular diseases, immune system disorders, neurological 
disorders, disorders affecting growth and development, lipid disordm, ceU proliferative disordm, 
35 and cancers, and in the assessmrait of the effects of exogenous conqiounds on the expression of 
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SUMMARY OF TBDE INVENTION 
The invention featuies purified polypeptides, kinases and phosphatases, m&md to 

5 collectively as ''KAF' and uidividually as 'TKAP-1." 'TKAP-Z." ••KAP-3." 'TKAPA" **KAP-5;' 
••KAP-6," 'TKAP-T," **KAP-8," 'XAP-9," •'KAP-10," ^TKAP-ll," 'TKAP-ll." •'KAP-IS." 'TKAP- 
14," "KAP-15," •TECAP-ie," '*KAP-17." 'TECAP-IS," **KAP-19," and •TKAP-ZO." In one aspect, the 
inv^tion provides an isolated polypeptide selected from the group consisting of a) a polypq>tide 
con^nising an annno acid sequence selected from the group consisting of SEQ ID NO: 1-20, b) a 

10 polypeptide conqyrismg a naturally occurring amino acid sequrace at least 90% identical to an 
amino acid sequence selected from the groiq) coosistii^ of SEQ ID NO:1-20, c) a biologically active 
fragment of a polypeptide having an amino add sequence selected from the group consisting of 
SEQ ID NO: 1-20, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group ccmsistiiig of SEQ ID NO:1-20. hi one alternative, the invention provides 

15 an isolated polypeptide conqirising the amino add sequoice of SEQ ID NO:1"20. 

The inv^iticHi further provides an isolated polynucleotide encoding a polyp^tide selected 
from die groiq> consisting of a) a polypeptide comprising an amino add sequence selected from die 
group consisting of SEQ ID NO: 1-20, b) a polypeptide conqirising a naturally occurring amino add 
sequence at least 90% identical to an amino acid sequence selected from the group consisting of 

20 SEQ ID NO: 1-20, c) a biologically active fragment of a polypq>tide having an ammo acid sequence 
selected .from the groiq) consisting of SEQ ID NO: 1-20, and d) an immunogenic fragment of a 
polypeptide havmg an ammo add sequence selected from the groiq> consisting of SEQ ID NO: 1-20. 
In one altmiative, the polynucleotide encodes a polypeptide selected from the groiq> consisting of 
SEQ ID NO:1-20. hi another alternative, the polynucleotide is selected from the group consisting of 

25SEQIDNO:21-40. 

AdditicmaUy, the invention provides a recombinant polynucleotide conqnising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino add sequence selected from the group 
consistiog of SEQ ID NO:1-20, b) a polypeptide conqnismg a naturally occutrii^ ammo add 

30 sequence at least 90% identical to an amino acid sequence selected from the group consisting of 
SEQ ID NO:1-20, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from tl^ group consisting of SEQ ID NO: 1-20, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-20. 
In one altmiative, the invention provides a cell transf onned with the recombinant polynucleotide, 

35 In another alternative, the invention provides a transgenic organism comprising the recombinant 



18 



wo 02/46384 



PCT/USOl/47431 



polynucleotide. 

The invention also provides a nietbod for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO:1-20, b) apolyp^tide comprising a naturally occurring amino acid 

5 sequence at least 90% identical to an amino acid sequence selected from the groiq> consisting of 
SEQ ID NO: 1-20, c) a biologically active fragment of a polypeptide having an amnio acid sequence 
selected from the group consistmg of SEQ ID NO: 1-20, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-20. 
The method conqxrises a) culturing a cell under conditions suitable for expression of the 

10 polypeptide, wherein said cell is transfonned with a recombinant polynucleotide comprising a 
promote sequence operably Imked to a polynucleotide encoding the polypeptide, and b) recovering 
the polypeptide so expressed. 

AdditionaUy, the invention provides an isolated antibody which specifically Innds to a 
polypeptide selected from the ffonp consisting of a) a polypeptide canqxrising an amino add 

15 sequence selected from the g^up consistiiig of SEQ n> NO: 1-20, b) a polypeptide compdsu^ 
naturally- occurring ammo acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-20, c) a biolo^cally active fragment of a polypeptide 
having an amino add sequence selected fixim the groiqp consisting of SEQ ID NO:1-20, andd) an 
immunogenic fragment of a polypq>tide having an amino acid sequence selected from the group 

20 consisting of SEQ ID NO: 1-20. 

The mvention further provides an isolated polynucleotide selected from the groiq> 
consisting of a) a polynucleotide comprismg a polynucleotide sequence selected from the groiq) 
consisting of SEQ ID NO:21-40, b) a polynucleotide conqxrising a naturally occurring 
polynucleotide sequence at least 90% identical to a polynucleotide sequence selected from the 

25 gioup consisting of SEQ ID N0:21^, c) a polynucleotide conqxlementaiy to tihe polynucleotide of 
a), d) a polynucleotide complem^iitaiy to the polynucleotide of b), and e) an RNA equivalent of a>- 
d). In one alternative, the polynucleotide conqnises at least 60 contiguous nucleotides. 

AdditionaUy, the invention provides a method for detecting a target polynucleotide in a 
sanqxle, said target polynucleotide having a sequence of a polynucleotide selected from the group 

30 consisting of a) a polynucleotide conqnising a polynucleotide sequence selected from the gcoxxp 
consisting of SEQ ID NO:21-40, b) a polynucleotide conqxrising a naturally occurring 
polynucleotide sequence at least 90% identical to a polynucleotide sequence selected from the 
groiqi consisting of SEQ ID NO:21-40, c) a polynucleotide conqxlementary to the polynucleotide of 
a), d) a polynucleotide conqxlementaiy to the polynucleotide of b), and e) an RNA equivalent of a)- 

35 d). The method conqxrises a) hytmdizmg the sample with a probe conqnising at least 20 contiguous 
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nucleotides conqpiising a sequence con[q>Iementaiy to said target polynucleotide in the san^Ie, and 
which probe specifically hybridizes to said target polynucleotide, under conditions whereby a 
hybridization complex is formed between said probe and said target polynucleotide or fragments 
thereof, and b) detecting the presence or absence of said hybridization conq>lex, and optionally, if 
5 present, the amount thereof. In one alternative, the probe comprises at least 60 contiguous 
nucleotides. 

The mvention further provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequ^ce of a polynucleotide selected from the group consisting 
of a) a polynucleotide CQnq>rising a polynucleotide sequence selected from the ffovp consisting of 

10 SEQ ID NO:21-40, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at 
least 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:21-40, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b), and e) an KNA equivalent of a)-d). The method 
conoprises a) amplifying said target polynucleotide or fcagjaieat thereof usmg polymerase chain 

IS reaction anq^lification, and b) detecting the presence or absence of said anq>lified target 
polynucleotide or fragment thmof , and, optionally, if preset, the amount thereof 

The mvention Anther provides a composition con^rising an effective amount of a 
polypeptide selected from the group consisting of a) a polypeptide conqmsing an amino acid 
sequence selected from the group consisting of SEQ ID NO:1-20, b) a polypeptide conq[)rising a 

20 naturally occurrnig amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-20, c) a Hologically active frragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO:1-20, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting ofSEQ ID NO: 1-20, and a pharmaceutica]lyaccq)tableexcipimt. In one embodiment, 

25 the composition conq>rises an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-20. The invention additionally pDvides a method of treating a disease or condition associated 
with decreased expression of functional KAP, comprising administering to a patient in need of such 
treatmi^t tl^ composition. 

The invention also provides a method for screening a compound for effectiveness as an 

30 agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
add sequmce selected from the group consisting of SEQ ID NO: 1-20, b) a polypeptide con^rising 
a naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consistmg of SEQ ID NO: 1-20, c) a biologically active fragment of a polypeptide 
having an amino add sequence selected from the group consisting of SEQ ID NO: 1-20, and d) an 

35 immunogenic fragment of a polypq>tide having an amino acid sequence selected from the group 
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CQDsistmg of SEQ ID NO: 1-20. The metbod cQnq)]ises a) e;q)osing a sample comprising the 
polypeptide to a compound, and b) detecting agonist activity in the sample. In one alt^native, the 
invention provides a conq>osition conqnising an agonist compound identified by the method and a 
phannaceutically acceptable exdpient hi another alternative, the invention provides a method of 

5 treating a disease or condition associated with decreased expression of functional ElAP, conqirising 
administering to a patient in need of such treatment the CQnqK>siti(Hi. 

Additionally, tiie invention provides a method for screening a compound for effectiveness 
as an antagonist of a polypeptide selected fix>m the groiqp consisting of a) a polypeptide comprismg 
an anuno add sequence selected from the group consisting of SEQ ID NO: 1-20, b) a polypeptide 

10 conqnising a naturally occurring amino acid sequence at least 90% identical to an amino acid 
sequence selected from the groiq) consisting of SEQ ID NO: 1-20, c) a biologically active fragment 
of a polypeptide having an amino acid sequence selected from the groiq> consisting of SEQ ID 
N0:l*>20, and d) an immunogenic fragment of a polypeptide having an amino add sequmce selected 
from the group consisting of SEQ ID NO:1-20. The nsethod comprises a) exposing a sanq>le 

IS comprising the polypeptide to a compound, and b) detecting antagCHiist activity in the sample. In 
one alternative, tiie invention provides a conq)Osition ccnnprising an antagonist CQnqx>und idoitified 
by the method and a phaimaceutically acceptable excipient In another alternative, the invention 
provides a method of tieattng a disease or condition associated with overexpression of functional 
KAP, comprising administering to a patient in need of such treatment the conoposition. 

20 The invmtion further provides a method of screening for a compound that specifically 

binds to a polypeptide selected from the group consisting of a) a polypeptide conqnising an amino 
acid sequence selected from the group consistmg of SEQ ID NO: 1-20, b) a polypeptide conqxrising 
a naturally occurring amino add sequence at least 90% identical to an amino add sequence selected 
from the group counting of SEQ ID NO: 1-20, c) a biologically active fragment of a polypeptide 

25 having an amino add sequence selected fit>m the group ccmsisting of SEQ ID NO: 1-20, and d) an 
immunogenic fragment of a polypeptide havtag an amino add sequence selected from the group 
consisting of SEQ ID NO: 1-20. The method comprises a) combining the polypeptide with at least 
erne test conqx>und undor suitable conditions, and b) detectmg Unding of the polypeptide to the test 
conq[K>und, thereby identifying a compound that specifically binds to the polypeptide. 

30 The invaition further provides a mediod of screenii^ for a compound that modulates the 

activity of a polypeptide selected from the groiq> consisting of a) a polypq>tide conqsrising an ammo 
acid sequence selected from die group consisting of SEQ ID NO:1-20, b) a polypeptide conqirising 
anaturally occurring ammo add sequoice at least 90% identical to an amino add sequrace selected 
from the group consistmg of SEQ ID NO: 1-20, c) a biolo^cally active fiagment of a polypeptide 

35 having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-20, and d) an 
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iTrnwi mnggnic firagDoeut of a polypeptide having an amino acid sequence selected fiom ^ group 
consisting of SEQ ID NO: 1-20. The method conq>ii5es a) combining the polypeptide with at least 
one test conopound under conditions pennissive for the activity of the polypqptide, b) assessing the 
activity of the polypeptide in the presence of the test compound, and c) comparing the activity of 

5 the polypeptide in the presence of the test compound with the activity of the polypeptide m the 
absence of the test confound, wherein a change in the activity of the polypeptide in the presence of 
die test conq)ound is kidicative of a compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveoiess in 
altering expression of a target polynucleotide, wherein said target polynucleotide conqmses a 

10 polynucleotide sequ^ce selected £rom the group consisting of SEQ ID NO:21-40, the method 
coniprising a) e;qK>sing a sample comprising the target polynucleotide to a compound, b) detecting 
alt«?ed expression of the target polynucleotide, and c) comparing the expression of the target 
polynucleotide in die presence of varying amounts of the compound and in the absence of the 
compound. 

IS The invention furdier provides a mediod for assessing toxicity of a test compound, said 

method campxisang a) treating a biolo^cal sample containing nucleic acids with the test compound; • 
b) hybridizing die nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide selected from the groiq> consisting of i) a polynucleotide 
comprising a polynucleotide sequence selected fiom the group consisting of SEQ ID NO:21-40, ii) 

20 a polynucleotide compising a naturally occurring polynucleotide sequence at least 90% idratical to 
a polynucleotide sequence selected from die groiq> consisting of SEQ ID NO:21-40, iii) a 
polynucleotide having a sequence conq>lementary to i), iv) a polynucleotide conq>lementary to the 
polynucleotide of ii), and v) an RNA equivalent of i>iv). Hybridization occurs undesr conditicms 
whereby a specific hybridization complex is formed betwera said probe and a target polynucleotide 

25 in the Inolo^cal sample, said targ^ polynucleotide selected firom the group conmtmg of i) a 

polynucleotide comprising a polynucleotide sequence selected from the groiq> consisting of SEQ ID 
NO:21-40, ii) a polynucleotide coixQ>rising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from die group consisting of SEQ ID NO:21- 
40, iii) a polynucleotide coniq>lementary to die polynucleotide of i), iv) a polynucleotide 

30 conqplementary to die polynucleotide of ii), and v) an RNA equivalent of i)-iv). Altematively, the 
target polynucleotide conpdses a fragment of a polynucleotide sequence selected fiom the group 
consisting of i)-v) above; c) quantifying the amount of hybridization complex; and d) coiqparing die 
amount of hybridization conoplex in the treated biological sanq>le with the amount of hybridization 
conqplex in an untreated biological sanqple, wheran a differ^ce in the amount of hybridization 

35 conqilex m the treated biological sanq>le is mdicative of toxicity of the test compound. 
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BRIEF DESCREPnON OF THE TABLES 
Table 1 summarizes the nomenclatUFe for the fiill length polynucleotide and polypeptide 
sequences of the present invention. 
5 Table 2 shows the GenBank identification number and annotaticm of the nearest GenBank 

homolog for polypeptides of the invention. The probability scores for the matches between each 
polypeptide and its hoznolog(s) are also shown. 

Table 3 shows structural features of polypeptide sequences of the invention, including 
predicted motifs and donoains, along with the methods, algmtfams, and searchable databases used 
10 for analysis of the polypeptides. 

Table 4 lists the cDNA atul/or genomic DNA fragments which were used to assemble 
polynucleotide sequraces of the invention, along with selected fragments of the polynucleotide 
sequences. 

Table S shows die rq;xresentative cDNA library for polynucleotides of the invention. 
15 Table 6 iHX>vides an appendix which describes the tissues and vectors used for construction 

of the cDNA libraries shown in Table S. 

Table 7 shows the tools, programs, and algorithms used to analyze die polynucleotides and 
polypeptides of the invention, along with applicable descriptions, references, and threshold 
parameters. 

20 

DESCRIPTION OF THE INVHWION 

Before the present protems, nucleotide sequences, and methods are described, it is 
understood that diis invention is not limited to the particular machines, materials and methods 
described, as these may vary, is also to be understood that the tenninology used herein is for the 
25 purpose of describing particular, raibodiments cnly, and is not intended to limit the scope of the 
preset invention which will be limited only by the appraded claims. 

It must be noted that as used herein and in the appeoded claims, the singular forms ^^a," 
**an," and "ttie** include plural teSesGDCc unless the context clearly dictates otherwise. Thus, for 
exanf>le, a reference to ''a host ceU" includes a plurality of such host cells, and a refi^nce to "an 
30 antibody^ is a reference to one or mine antibodies and equivalents thereof known to those skilled in 
the art, and so f rath. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly undmtood by one of ordinary skill in the art to which this invention 
belongs. Although any machines, materials, and methods similar or equivalent to those described 
35 herein can be used to practice or test the present mvention, the preferred machines, materials and 
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methods aie bow described. All publications mentioned heiein aie cited for the purpose of 
describing and disclosing the cell lines, protocols, reagents and vectors which are reported in the 
publications and which might be used in connection with the invention. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate such disclosure by virtue of 
5 inior invention. 
DEFINITIONS 

''KAP" refers to the amino acid sequences of substantially purified KAP obtained from any 
species, particularly a mammalian species, including bovine, ovme, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or leconibinant 

10 The term "agonist" refers to a molecule which intensifies or noimics the biological activity 

of KAP. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
conq>ound or composition which modulates the activity of KAP either by directiy interacting witfi 
KAP or by acting on conq>onents of the biological pathway in which KAP participates. 

An "allelic variant" is an alternative form of the gene encoding KAP. Allelic variants may 

15 result from at least one mutation in the nucleic acid sequence and may result in alt^ied mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise 
to allelic variants are generally ascribed to natural deletions, additions, or substitutions of 
nucleotides. Each of these types of changes may occur alone, or in combination with the others, one 

20 or more times in a given sequence. 

"Altered" nucleic acid sequences encoding KAP include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as KAP or a 
polypeptide with at least one functional characteristic of KAP. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe 

25 of the polynucleotide encoding EAP, and inqnroper or unexpected hybridization to allelic variants, 
with a locus other than the normal chromosomal locus for the polynucleotide sequence encoding 
KAP. The encoded protein may also be "altered,^ and may contain deletions, insertions, or 
substitutions of amino acid residues which produce a silent change and result in a functionally 
equivalent KAP. Deliberate amino acid substituticnis may be made on the basis of similarity in 

30 polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the anq)hipatbic nature of the 
residues, as long as the biological or innnunological activity of KAP is retained For example, 
negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged 
amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having 
similar hydrophilicity values may include: aspaiagine and glutamine; and serine and threonine. 

35 Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine. 
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isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or 
synthetic molecules. Where "amino acid sequrace" is recited to refer to a sequence of a naturally 
5 occulting protein molecule, "amino acid sequence" and like tenns are not meant to limit the ammo 
acid sequence to the conq>lete native amino add sequence associated with the recited protein 
molecule. 

"Aiuplification" relates to the production of additional copies of a nucleic acid sequence. 
An[q>lification is generally carried out using polynimse chain reaction (PGR) technologies well 
10 known in the art 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological 
activity of KAP. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, 
small molecules, or any other conq)ound or conqK>sition which modulates the activity of KAP either 
by directiy interactmg with KAP or by acting on conqKments of the biological pathway in which 
15 KAP participates. 

The term "antibody" refers to intact imnumoglobulin molecules as well as to fragments 
thereof, such as Fab, F(ab')2, and Fv fragments, which are capable of binding an epitopic 
determinant Antibodies that bind KAP polypeptides can be prepared using intact polypeptides or 
using fragments containing small peptides of interest as the immunizing antigen. The polypeptide 
20 or oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from 
the translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if 
desired. Commonly used carriers that are chemically coupled to peptides include bovine serum 
albumin, thyroglobulin, and keyhole linopet hemocyanin (KLH). The coupled peptide is then used 
to immunize the animal. 

25 The term "antigcadc determinant refers to that region of a molecule (i.e., an epitope) that 

makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-^limensional structures 
on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 

30 used to elicit the immune response) for binding to an antibody. 

The term "aptamer^' refers to a nucleic acid or oligonucleotide molecule that binds to a 
specific molecular target Aptamers are derived from an in vitro evolutionary process (e.g., SELEX 
(Systematic Evolution of ligands by Exponential Enrichment), described in U.S. Patent No. 
5,270,163), which selects for target-specific aptamer sequences from large combinatorial libraries. 

35 Aptamer compositions may be double-stranded or single-stranded, and may include 
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deoxyribonucleotides, ribcmucleolides, nucleotide derivatives, or otber nucleotide-like molecules. 
The nucleotide components of an aptamer may have modified sugar groups (e.g., the 2*-0H group of 
a ribonucleotide may be replaced by 2 -F or 2'-NH2), which may improve a desired property, e.g., 
resistance to nucleases or long^ lifetime in blood. Aptamers may be conjugated to other molecules, 
5 e.g., a high molecular weight carrier to slow clearance of the ^tamer from the circulatory systeia 
Aptamers may be specifically cross-linked to their cognate ligands, e.g., by photo^tivation of a 
cross-linker. (See, e.g., Brody, E.N. and L. Gold (2000) J. Biotechnol. 74:5-13.) 

The term "intramer^' refers to an aptamer which is expressed in vivo . For example, a 
vaccinia virus-based RNA expression system has been used to express specific RNA aptamers at 
10 high levels m the cytoplasm of leukocytes (Blind, M. et al. (1999) Proc. Nad Acad. Sci. USA 
96:3606-3610). 

The term "spiegehnef refers to an aptamer which includes L-DNA, Lr-RNA, or other left- 
handed nucleotide derivatives or nucleotide-like molecules. Aptamers containing left4ianded 
nucleotides are resistant to degradation by naturally occurring enzymes, which normally act on 

15 substrates containing right-handed nucleotides. 

The term "antisense" refers to any coroposition capable of base-pairing with the "sense" 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; 
RNA; peptide nucleic acid (FNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 

20 sugar groups such as 2 -methoxyethyl sugars or 2 -methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5-methyl cytosine, 2*-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 
molecides may be produced by any method includmg chemical synthesis or transaiption. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 

25 translation. The designation **negative" or **minus" can refer to the antisense strand, and the 
designation "positive" or "plus** can refer to the sense strand of a reference DNA nobolecule. 

The term "biologically active" refers to a protein having structural, regulatory, or 
biochemical functions of a naturally occurring molecule. Likewise, "immunologically active" or 
"immunogaiic" refers to the capability of the natural, recombinant, or synthetic KAP, or of any 

30 oligopeptide theireof, to induce a specific immune response in appropriate animals or cells and to 
bmd with specific antibodies. 

"G>ixq>leine]itary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5'-AGT-3' pairs with its coiiq)lement, 
3'-TCA-y. 

35 A "con^ositlon conqnising a given polynucleotide sequence*' and a ^conq>osition 
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CQmpTising a givm amino acid sequence" refer broadly to any coniposition containing the given 
polynucleotide or amino acid sequence. The composition may conq>iise a dry fmmulation or an 
aqueous solution. Conq>ositions conqirising polynucleotide sequences encoding KAP or fragments 
of KAP may be enq)loyed as hybridization probes . The probes may be stored in fieeze-dried form 
5 and may be associated with a stabilizing agent such as a caibdbydrate. In hybridizations, the probe 
may be deployed in an aqueous solution containing salts (e.g., NaQ), detergents (e.g., sodium 
dodecyl sulfate; SDS), and other conq>onents (e.g., Denhardf s solution, dry milk, sahnon sperm 
DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has heea subjected to 
10 repeated DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit 
(Applied Biosystrans, Foster Gty CA) in the 5' and/or the 3* direction, and resequenced, or which 
has been assembled from one or nxne overlapping cDNA, EST, or genomic DNA fi:agments using a 
computer program for fragm^t assembly, such as the GELVIEW fragment assembly system (GCG, 
Madison WI) or Phrap (University of Washington, Seattie WA). Some sequences have been both 
15 extended and assembled to produce the consensus sequence. 

'"Conservative amino acid substitutions** are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the frmction of 
. die protein is conserved and not signifiicantiy changed by such substitutions. The table below shows 
amino acids which may be substituted for an original amino acid in a protein and which are 
20 regarded as conservative amino acid substitutions. 



Qrigmal Residue Conservative Substitution 





Ala 


Gly, Ser 




Arg 


His, Lys 




Asn 


Asp, Gin, Ifis 


25 


Asp 


Asn, Glu 




Cys 


Ala, Ser 




Ghi 


Asn, Glu, His 




Glu 


Asp,Ghi,His 




Gly 


Ala 


30 


His 


Asn, Aig, Ghi, Glu 




He 


Leu, Val 




' Leu 


He, Val 




Lys 


Arg, Ghi, Glu 




Met 


Leu, Be 


35 


Phe 


His, Met, Leu, Tip, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 




Tip 


Phe, Tyr 




Tyr 


Ifis, Phe, Tip 


40 


Val 


He, Leu, Thr 



Conservative amino add substitutions genocally vtutyttstm (a) die stnictore of the 
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polypeptide backbcme in the area of the substitution, for example, as a beta sheet or alpha helical 
conformation, (b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or 
(c) the bulk of the side chain. 

A "deletion'* refers to a change in the amino acid or nucleotide sequence that results in the 
5 absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide can include, for exanq>le, replacement of hydrogen by 
an alkyl, acyl, hydroxy!, or amino groiq>. A derivative polynucleotide encodes a polypeptide which 
retains at least one biological or immunolo^cal function of the natural molecule. A derivative 
10 polypeptide is one modified by glycosylation, pegylation, or any similar process that retains at least 
one biological or immunological function of the polypeptide from which it was d^ved. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of gqaerating a 
measurable signal and is covalentiy or noncovalentiy joined to a polynucleotide or polypeptide. 

"Differential expression" refers to increased or upregulated; or decreased, downregulated, 
15 or absent gene or protein expression, determined by conq>aring at least two different sanoples. Such 
con^arisons may be carried out between, for exanqile, a treated and an untreated sanq>le, or a 
diseased and a normal sanq>le. 

"Exon shuffling" refers to the recombination of different coding regions (exons). Since an 
exon may represent a structural or functional domain of the encoded protein, new proteins may be 
20 assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 
evolution of new protein functions. 

A "fragment" is a unique porticm of KAP or tte polynucleotide encoding KAP which is 
identical in sequence to but shorter in l^gth than the parent sequence. A fragment may comprise 
up to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For 
25 exa^^)le, a fi^gment may compiise from 5 to 1000 contiguous nucleotides or amino acid residues. 
A fiugment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at 
least 5, 10, 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or 
amino acid residues in length. Fragments may be preferentially selected from certain regions of a 
molecule. For example, a polypeptide fragment may conqnise a certain length of contiguous anoino 
30 acids selected from the first 250 or 500 amino acids (or first 25% or 50%) x)f a polypeptide as 
shown m a certain defined sequence. Qearly these Imgths are exeniplary, and any length that is 
supported by the specification, including the Sequence Listing, tables, and figures, may be 
enconq)assed by the present embodiments. 

A fragn^nt of SEQ ID NO:21-40 conq)rises a region of unique polynucleotide sequence 
35 that specifically identifies SEQ ID NO:21-40, for exanq)le, as distinct from any other sequence m 
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the genome from which the fragment was obtained. A fragment of SEQ ID NO:21-40 is usefiil, for 
exanq)Ie, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:21-40 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO:21-40 and the region of SEQ ID NO:2I-40 to which the fitagmmt ccnesponds aie routinely 
5 determinable by one of oidinaiy skill in the ait based on ttie intmded purpose for the fragmrat 
A fragment of SEQ ID NO: 1-20 is encoded by a fragment of SEQ ID NO:21-40. A 
fragment of SEQ ID NO:1-20 comprises a region of unique amino acid sequence that specifically 
identifies SEQ ID NO: 1-20. For exanople, a fragmrat of SEQ ID NO: 1-20 is usefid as an 
immunogenic peptide for the development of antibodies that spedficaUy recognize SEQ ID NO:l- 
10 20. The precise length of a fragment of SEQ ID N0:l"20 and die region of SEQ ID NO:1-20 to 
which the fragment corresponds are routinely determinable by one of ordinary skill in (he art based 
on the intended purpose for the fragment 

A '*full length" polynucleotide sequence is one containing at least a translation initiation 
codon (e.g., methionine) followed by an open reading firame and a translation termination codon. A 
15 'full length" polynucleotide sequence encodes a '*full length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between 
two or more polynucleotide sequences or two or more polypeptide sequences. 

The terms **percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
. 20 standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps 
in the sequences being conq)ared in order to optimize alignment between two sequences, and 
■ therefore achieve a more meaningful conq>arison of the two sequences. 

Paicent identity between polynucleotide sequences may be determined usmg the default 
parameters of the CLUSTAL V algoritinn as incorporated into the MEGALIGN version 3. 12e 
25 sequence alignment program. This program is part of the LASERGENE software package, a suite of 
molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 
ffiggins, D.G. and PM. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) 
CABIOS 8:189-191. For pairwise aligoments of polynucleotide sequences, the default parameters 
are set as follows: Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" 
30 residue weight table is selected as the default Percmt identity is reported by CLUSTAL V as the 
"percent similarity" between aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence con^arison 
algorithms is provided by the National Center for Biotechnology Information (NC3BI) Basic Local 
Alignment Search Tool (BLAST) (Altschul, S.R et al. (1990) L Mol. Biol. 215:403-410), which is 
35 available from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
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http://www.iicbi.nIm.nih.gov/BLASTA The BLAST softwaie suite includes various sequence 
analysis programs including 't^lastn/' that is used to align a known polynucleotide sequence with 
othCT polynucleotide sequences from a variety of databases. Also available is a tool called 'BLAST 
2 Sequences" that is used for diiect pairwise comparison of two nucleotide sequences. "BLAST 2 
5 Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/goi:£1)12.html. 
Hie "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are conmionly used with g^ and other parameters set to defoult settmgs. For exanqple, to 
conq)are two nucleotide sequences, one may use blastn with the "'BLAST 2 Sequences" tool 
Version 2.0. 12 (April-21-2000) set at default parameters. Such default parameters may be, f cxr 
10 example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty formismatch: -2 

Open Gap: 5 and Extension Gqp: 2 penalties 
15 Gap X drop-cff: 50 

Expect: 10 

WonlSize: 11 

Fiber: on 

Percent identity may be measured over the length of an entire defined sequence, for 

20 exanq)le, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
exan^le, over the length of a fiagment taken from a larger, defined sequence, for instance, a 
firagment of at least 20, at least 30, at least 40, at least SO, at least 70, at least 100, or at least 200 
contiguous nucleotides. Such lengths are exen^ilary only, and it is understood that any fragment 
length supported by the sequences shown herein, in the tables, figures, or Sequence Listing, may be 

25 used to describe a length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that 
changes in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic 
acid sequences that all encode substantially the same proteiiL 

30 The phrases '"percent identity" and **% identity," as applied to polypeptide sequences, refer 

to the percentage of residue matches between at least two polypq)tide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 
alignment methods take into account conservative amino acid substitutions. Such conservative 
substitutions, explained in more detail above, generally preserve the charge and.hydrophobicity at 

35 the site of substitution, thus preserving the structure (and therefore function) of the polypeptide. 
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Peicrat identity betwera polypeptide sequences may be detetnuned using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGAIiGN version 3.12e 
sequence alignment program (described and referraced above). For pairwise alignments of 
polypeptide sequences using CLUSTAL V, tte default parameters are set as follows: Kttq>le=l, gap 
5 penalty=3, window=S, and ''diagonals saved"=^. The PAM2S0 matrix is selected as the default 
residue weight table. As with polynucleotide alignments, the parent idratity is reported by 
CLUSTAL V as the ''percent similarity" between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
10 2.0.12 (April-21-2000) with blas^ set at default parametm. Such default parameters may be, for 
exanq)le: 

Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
Gap X drop-off: 50 
15 Ejq>ect: 10 

Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
for exanq>Ie, as defined by a particular SEQ ID number, or may be measured over a shorter length, 

20 for example, over the length of a fragm^t taken from a larger, defined polypeptide sequence, for 
mstance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at 
least 150 contiguous residues. Such lengths are exemplary only, and it is understood that any 
firagment length supported by the sequences shown h^in, in the tables, figures or Sequence 
Listing, may be used to describe a length over which percentage identity may be measured 

25 "Human artificial chromosome" (HACs) are linear microchromosomes which may contain 

DNA sequences of about 6 kb to 10 Mb in size and which contain aU of the elements required for 
chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 

30 resembles a human antibody, and still retains its original binding ability. 

'Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand Aiough base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of 
conq>lementarity. Specific hybridization conq)lexes form under permissive annealing conditions 

35 and remain hybridized after the "washing" step(s). The washing $tep(s) is particularly important in 
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detennming the stringency of the hybiidizatiQn process, with more stringent conditions allowing 
less non-specific binding, i.e., binding betwera pairs of nucleic acid strands that are not perfectly 
matched Permissive conditions for annealing of nucleic acid sequences are routinely determinable 
by one of ordinary skiU in the art and may be consistrat among hybridization experiments, whereas 
5 wash conditions may be varied among e;q)eriments to achieve the desired stringency, and therefore 
hybridization specificity. Permissive annealing conditions occur, for exanq>le, at 68^C in the 
presence of about 6 x SSC, about 1% (w/v) SDS, and about 100 ^g/ml sheared, denatured salmon 
sperm DNA. 

Graerally, stringracy of hybridization is expressed, in part, with reference to the 

10 ten^erature under which the wash step is carried out. Such wash temperatures are typically 

selected to be about S'^C to 20°C lower than the thennal melting point (T J for the specific sequence 
at a defined ionic strength and pH. The T„ is the temperature (under defined ionic strength and pS) 
at wMch 50% of the target sequ^ce hybridizes to a perfectly noatched probe. An equation for 
calculating T„ and conditions for nucleic add hybridization are well known and can be found in 

15 Samhrook. J. et aL (1989) Molecular Q ^ninpr- A T A boratorv Manual 2°* ed., vol. 1-3. Cold Spring 
Harbor Press, Plainview NY; specifically see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present 
invention include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, 
for 1 hour. Alternatively, tCTq)eratures of about 65°C, 60**C, 55^C, or 42**C may be used SSC 

20 concentration may be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. 
Typically, blocking reagents are used to block non*-specific hybridization. Such blocking reagents 
include, for instance, sheared and denatured salmon sperm DNA at about 100-200 /ig/ml. Organic 
solvent, such as formamide at a concentration of about 35-50% v/v, may also be used under 
particular circumstances, such as for RNA:DNA hybridizations. Useful variations on these wash 

25 conditions will be readily apparent to those of ordinary skill in the art Hybridization, particularly 
under high stringency conditions, noay be suggestive of evolutionary similarity between the 
nucleotides. Such similarity is strongly indicative of a similar role for the nucleotides and their 
encoded polypeptides. 

The term '^hybridization complex** refers to a complex fonned betwera two nucleic add 

30 sequences by virtue of the fmnation of hydrogen bonds between complementary bases. A 

hybridization con^lex may be fonned in solution (e.g., Qt or R^t analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate 
substrate to which cells or their nucleic acids have ho&a fixed). 

35 The words "insertion** and ''addition'* refer to changes in an amino acid or nucleotide 
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sequence lesulting in the addition of one or more annno acid residues or nucleotides, respectively. 

'Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or gpnetic disease, etc. These conditions can be characterized by expression 
of various fectors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
5 cellular and systemic defense systems. 

An "immunogenic fragment" is a polypq[>tide or oligopeptide fragment of KAP which is 
capable of eliciting an immune re^cmse when introduced into a living organism, for exaiq)le, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide 
fragment of KAP which is useful in any of the antibody pioduction mediods disclosed herdn or 
10 known in the art 

The term "microanay" refers to an arrangement of a plurality of polynucleotides, 
polypeptides, or oth^ chemical compounds on a substrate. 

The terms '*elemenf and "array element*' refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microairay. 
15 The tenn ''modulate" refers to a change in die activity of KAP. For example, modulation 

may cause an increase or a decrease in protein activity, binding characteristics, or any other 
biological, functional, or immunological properties of KAP. 

The phrases **nucleic acid" and **nucleic acid sequence" refer to a nucleotide, 
oligonucleotide, polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA 
20 of genomic or synthetic origin which may be single-stranded or double-stranded and may represent 
the sense or the antisense strand, to peptide nucleic add (PNA), or to any DNA-like or RNA-like 
material. 

"Op^bly linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
. 25 linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in tiie same reading frame. 

'Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene ag&nt which 
conqnises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone 
30 of amino acid residues ^ding in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind conq)lementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

Tost-translational nK>dificatiGn" of an KAP may involve lipidation, glycosylation, 
phosphorylation, acetylation, ral:emization, proteolytic cleavage, and other modifications known in 
35 the art Theseprocessesmay occur syntheticaUy or biochemicaUy. Biochemical modifications will 
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vary by cell type depending on the enzymatic milieu of KAP. 

'"Probe" refers to nucleic add sequences encoding KAP, their conq)lenients, or firagn^nts 
thereof, which are used to detect identical, allelic or related nucleic add sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 

S Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. 
'Trimers" are short nucleic adds, usually DNA oligonucleotides, which may be annealed to a target 
polynucleotide by conq)lementary base-pairing. The primer may then be extended along the taiget 
DNA strand by a DNA polymerase enzyme. I^imer pairs can be used for anq>lification (and 
identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PGR). 

10 Probes and primers as used in the present invention typically conq>rise at least IS 

contiguous nucleotides of a known sequence. In ord^ to enhance specifidty, long^ probes and 
primers may also be employed, such as probes and primers that comprise at least 20, 2S, 30, 40, SO, 
60, 70, 80, 90, 100, or at least ISO consecutive nucleotides of the disclosed nucleic add sequences. 
Probes and primers may be considerably longer than these exanqiles, and it is understood that any 

IS length supported by the specificaticm, including the tables, figures, and Sequence Listing, may be 
used. 

Methods for preparing and using probes and priroers are described in the references, for 
exan5>le Samhrook, J. et al. (1989) Molecular C lnTiiTip;! A Laboratory Manual. 2"* eA, vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, FJM. et al. (1987) Current Protocols in Molecular 
20 Biolojgy. Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et al. (1990) PGR 
Protocols. A Guide to Methods and Applications. Academic Press. San Diego CA. PGR primer 
pairs can be derived from a known sequ^ce, for exanople, by using conq>uter programs intended for 
that purpose such as Primer (V^on 0.S, 1991, Whitehead Institute for Biomedical Research, 
Cambridge MA). 

2S Oligonucleotides for use as prixu^ are selected using software known in the art for such 

purpose. For exanqile, OLIGO 4.06 software is useful for the selection of PGR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of iq) to 
5,000 nucleotides fit)m an input polynucleotide sequence of up to 32 Idlobases. SimOar primer 
selection programs have incorporated additional features for expanded capabilities. For exan^le, 

30 the PrimOU primer selection program (available to the public from the Genome Center at University 
of Texas South West Medical Center, Dallas TX) is capable of choosing specific primers fix)m 
megabase sequences and is thus useful for designing primers on a genome-wide scope. The PrimerS 
primer selection program (available to the public from the Whitehead Institute/MIT Center for 
Genome Research, Cambridge MA) allows the user to input a '*mispriming library," in which 

35 sequences to avoid as primer binding sites are user-specified. PrimerS is useful, in particular, for 



34 



wo 02/46384 



PCT/USOl/47431 



the selection of oligonucleotides for micioaiTays. (The source code for the latter two primer 
selection programs may also be obtained from their respective sources and modified to meet the 
user' s specific needs.) The PrimeGm program (available to the public fix)m the UK Human 
Graonoe Mapping Project Resource Centre, Cambridge UK) designs primers based on multiple 
5 sequence alignments, thereby allowing selection of primers that hybridize to either the most 
conserved or least conserved regions of aligned nucleic acid sequences. Hence, this program is 
useful for identification of both unique and conserved oligonucleotides and polynucleotide 
fragments. The oligonucleotides and polynucleotide fragmrats identified by any of the above 
selection methods are usefid in hyt»idization technologies, for exanq)le, as PCR or sequencing 

10 primers, microarray elements, or specific probes to identify fully or partially conq)lenientary 
polynucleotides in a san^le of nucleic acids. Methods of oligonucleotide selection are not limited 
to those described above. 

A '^ombinant nucleic acid" is a sequence that is not naturally occuiring or has a sequence 
that is made by an artificial combination of two or more otherwise separated segnients of sequence. 

15 This artificial combination is often accon^>Ii5hed by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, 
a recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 

20 sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 
transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g,, based on a 
vaccinia virus, that could be use to vaccinate a Tnamnial wherein the recombinant nucleic acid is 
eiqiressed, inducing a protective immunological response in the maTmnal- 
25 A '^regulatory element*' ref^s to a nucleic acid sequence usually derived from untranslated 

regions of a gene and includes enhancers, promoters, introns, and 5* and 3' untranslated regions 
(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
translation, or RNA stability. 

'^Reporter molecules** are chemical or Inochemical moieties used for labeling a nucleic acid, 
30 ammo add, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 

chemiluminesc^t, or chromogauc agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known m the art 

An **RNA equivalent," m refermce to a DNA sequence, is coniposed of the same linear 
sequence of nucleotides as the ref erraice DNA sequence with the exception that all occurrences of 
35 the nitrogenous base thymine are replaced with uracil, and the sugar backbone is conq)05ed of 



35 



wo 02/46384 



PCT/USOl/47431 



ribose instead of deoxyribose. 

The torn "sample'* is used in its broadest sense. A sanqyle suspected of containing KAP, 
nucleic acids encoding KAP, or fragnmts thereof may conq>rise a bodily fluid; an extract from a 
cell, chromosome organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA« or 
5 cDNA, in solution or bound to a substrate; a tissue; a tissue piint; etc. 

The t^cms ^'specific binding*' and ''specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic bmding conqposition. The interaction is dependent upon the presence of a particular 
structure of the protein, e.g., the antigenic determinant or epitope, recognized by the bmding 
10 noolecule. For example, if an antibody is specific for epitope ''A," the presence of a polypeptide 
comprising the q>itppe A, or the presence of free unlabeled A, in a reaction containing fiee labeled 
A and the antibody will reduce the amount of labeled A that bmds to the antibody . 

The term "substantially purified" refers to nucleic acid or amino acid sequmces that are 
removed from their natural environment and are isolated or separated, and are at least 60% fiee, 
15 preferably at least 7S% free, and most preferably at least 90% free fix>m other conqKments with 
which they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nuclieotides 
. by different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
20 chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, chaimels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" or "expression profile" refers to the collective pattern of gene 
expression by a particular cell type or tissue under given conditions at a given time. 
25 'Transformation" describes a process by which exogenous DNA is introduced into a 

recipient cell. Transformation may occur under natural or artificial conditions according to various 
methods well known in the art, and may rely on any known method for the ins^on of foreign 
nucleic acid sequences into a prokaryotic or eukaryotic host cell. The n^thod for transformation is 
selected based on the type of host cell being transformed and may include, but is not limited to, 
30 bacteriophage or viral infection, electroporation, heat shock, lipofection, and particle bombardment 
The term "transformed cells" includes stably transformed cells in which the inserted DNA is 
capable of replication either as an autonomously replicating plasmid or as part of the host 
chromoson^, as well as transiently transfcomed cells which express the inserted DNA or RNA for 
limited periods of time. 

35 A "transgenic organism," as used herein, is any organism, including but not limited to 
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animals and plants, in which one or more of the cells of the organism contains het^logous nucleic 
add introduced by way of human intervention, such as by transgenic techniques well known in the 
art Thenucleicaddisintroducedintothecell,directly or indirectly by introduction into a 
precursor of the cell* by way of deliberate g^etic manipulation, such as by microinjection or by 
5 infection with a recombinant virus. Tte term genetic manipulation does not include classical 
cross-breeding, or in vitro fertilization, but rather is directed to the introduction of a recombinant 
DNA molecule. The transgenic organisms contemplated in accordance with the present invention 
include bacteria, cyanobactma, fungi, plants and animals. The isolated DNA of the present 
invention can be introduced into the host by n^ods known m the art, for exan^le infection, 

10 transfection, transfDnnation or transconjugation. Techniques for transferring the DNA of the 
present invration into such organisms are widely known and provided in references such as 
Sambrook et al. (1989), supnb 

A ^Variant" of a particular nucleic add sequence is defined as a nucleic add sequence 
having at least 40% sequmce identity to the particular nucleic acid sequence ovra- a certain lengdi of 

15 one of the nucleic acid sequences using blastn with the **BLAST 2 Sequences" tool Version 2.0.9 
(May-07-1999) set at default parameters. Such a pair of nucleic acids may show, for example, at 
least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 

{ 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 
99% or greater sequence identity ovct a certain defined l^gth. A variant may be described as, for 

20 exan:^le, an "allelic" (as defined above), "splice," "species," or •'polymorphic" variant A splice 
variant may have significant identity to a reference molecule, but will generally have a greater or 
lessCT number of polynucleotides due to alternate splicing of exons during mRNA processing. The 
corresponding polypeptide may possess additional fimctional domains or lack domains that are 
present in the reference molecule. Spedes variants are polynucleotide sequences that vary fiom one 

25 species to another. The resulting polypeptides will generally have significant amino acid identity 
relative to each other. A polynodphic variant is a variation in the polynucleotide sequence of a 
particular gene betwe^ individuals of a given spedes. Polymorphic variants also may encompass 
"single nucleotide polymorphisms" (SNPs) in which the polynucleotide sequence varies by one 
nucleotide base. The presence of SNPs may be indicative of, for example, a certain population, a 

30 disease state, or a propensity for a disease state. 

A **variant" of a particular polypeptide sequence is defined as a polypeptide sequwice 
having at least 40% sequence identity to the particular polypeptide sequence over a certain length of 
<Mie of the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 
(May-07-1999) set at default parameters. Such a pair of polypeptides may show, for example, at 

35 least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 91%, at least 92%, at least 
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93%. at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater 
sequence identity over a certain defined length of one of the polypeptides. 

TBOB INVENTION 

5 The invCTtion is based on the discovery of new human kinases and phosphatases (KAP), the 

polynucleotides encoding KAP, and the use of these compositions for the diagnosis, treatment, or 
prevention of cardiovascular diseases, immune system disorders, neurological discnders, disorders 
affecting growth and development, lipid disordm, cell proliferative disorders, and cancers. 

Table 1 summarizes the ncHnenclature for the full length polynucleotide and polypeptide 

10 sequences of the invention. Each polynucleotide and its corresponding polypeptide are correlated to 
a single Licyte project identification number (Incyte Project ID). Each polypeptide sequence is 
denoted by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an 
Incyte polypeptide sequence nmnber (Incyte Polypeptide ID) as shown. Each polynucleotide 
sequence is denoted by both a polynucleotide sequence identification nmnber (Polynucleotide SEQ 

15 ID NO:) and an Incyte polynucleotide cons^isus sequence number Qncyte Polynucleotide ID) as 
shown. 

Table 2 shows sequences with homology to the polypeptides of the invention as identified 
by BLAST analysis against the GenBank protein (genpept) database. Columns 1 and 2 show the 
polypeptide sequence identification numb^ (Polypeptide SEQ ID NO:) and the corresponding 

20 Incyte polypq)tide sequence number (Incyte Polypeptide ID) for polypeptides of the invention. 
Column 3 shows the GenBank identification numb^ (GenBank ID NO:) of the nearest GenBank 
homolog. Colunon 4 shows the probability scores fox the matches between each polypeptide and its 
homolog(s). Column 5 shows the aimotation of the GenBank homolog(s) along with relevant 
citations where applicable, all of which are expressly incorporated by referraice herein. 

25 Table 3 shows various structural features of the polypeptides of the invention. Columns 1 

and 2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding 
Incyte polypeptide sequence number (facyte Polypeptide ID) for each polypeptide of the invention. 
Column 3 shows the number of amino acid residues in each polypeptide. Column 4 shows potential 
phosphorylation sites, and column 5 shows potential glycosylation sites, as det^mined by the 

30 MOTIFS program of the GCG sequ^ce analysis software package (Gaieties Con^iuter Group, 
Madison WI). Column 6 shows amino acid residues conq>rising signature sequences, domains, and 
motifs. Column 7 shows analytical methods for protein structure/fimction analysis and in some 
cases, searchable databases to which the analytical methods were applied. 

Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and 

35 these properties establish that the claimed polypeptides are kinases and phosphatases. For exanople. 
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SEQ ID NO: 1 is 79% identical to rat protein tyrosine phosphatase TD14 (G^ask ID g3598974) as 
determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST 
probability score is 0.0, which indicates the probability of obtaining the observed polypeptide 
sequence aUgnment by chance. SEQ ID NO: 1 also contains protein-tyrosine phosphatase domain as 
5 determined by searching for statistically significant matches in the hidden Markov model (HMM)- 
based FFAM database of conserved protem family domaiiis. (See Table 3.) Data from BLIMPS, 
PROFDLESCAN and MOTIFS analyses provide further conoborative evidence that SEQ ID NO: 1 is 
a protein-tyrosine phosphatase. 

la an alternative example, SEQ ID N0:3 is 34% identical to Faeus svlvatica protem 

10 phosphatase 2C (PP2C GenBank ID g77681Sl) as d^ermmed by the Basic Local Alignment 
Search Tool (BLAST). (See Table 2.) The BLAST probability score is 6.4e-17, which mdicates the 
probability of obtaining the observed polypeptide sequence alignment by chance. SEQ ID N0:3 
also shares 45% identity with a putative Caenorhabditis eleeans PP2C (GenBank ID g2804429), 
based on BLAST analysis, with a probability score of 2.4e-71 . SEQ ID NO:3 contains protem 

15 phosphatase 2C domains as determined by searching for statistically significant matches in the 
hidden Markov model (HMM)-based PFAM database of conserved protein family domains. (See 
Table 3.) Data from BLIMPS analysis provide further corroborative evidence that SEQ ID NO:3 is 
a protein phosphatase 2C. 

In an alternative exanogple, SEQ ID NO:5 is 25% identical to human protein kinase PAK5 

20 (GenBank ID g7649810) as detramined by the Basic Local Alignment Search Tool (BLAST). (See 
Table 2.) The BLAST probability score is 7.2e-14, which indicates the probability of obtaining the 
observed polypeptide sequence alignment by chance. SEQ ID NO:5 also contains a eukaryotic 
protein kinase domain as determined by searching for statistically significant matches in the hidden 
Maricov model (IIMMH)ased PFAM database of conserved protein family domains. (See Table 3.) 

25 Data from TMAP analysis as weD as BLIMPS and BLAST analyses of the PRODOM and DOMO 
databases provide further corroborative evidence that SEQ ID NO:5 is a membrane-bound kinase. 

In an alternative exanqile, SEQ ID NO:6 is 1511 amino acid residues in length and is 97% 
identical over 1494 residues to hmnan MEK kinase I (GenBank ID g2815888) as determined by the 
Basic Local Alignment Search Tool (BLAST). (See Table 2,) The BLAST probability score is 0.0, 

30 which indicates the probability of obtaining the observed polypeptide sequence aligmmmt by 
chance. SEQ ID NO:6 also contains a eukaryotic protein kinase domain as determined by searching 
for statistically significant matches in the hidden Markov model (HMM)«based PFAM database of 
conserved protein family domains. (See Table 3.) Data from BLIMPS, MOTIFS, and 
PROFELESCAN analyses provide fiirther corroborative evidence that SEQ ID NO:6 is protein 

35 kinase. hi an alternative exan5)le, SEQ ID N0:9 is 87% identical to murine protein kinase 
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(GenBank ID g4060S8) as detennined by the Basic Local Aligmneat Search Tool (BLAST). (See 
Table 2.) The BLAST probability score is 0.0, which indicates the prol^bility of obtaining the 
observed polypeptide sequence alignment by chance. SBQ ID N0:9 also contains an eukaryotic 
protein kinase domain and a FDZ domain as determined by searching for statistically significant 
5 matches in the hidden Markov model (HMM)-based FFAM database of conserved protein family 
domams. (See Table 3.) Data from BLIMPS, MOTIFS, and PROFDJSSCAN analyses p^ 
fiirdier corroborative evidence that SEQ ID N0:9 is a protein kinase. 

hi an alternative exanq)le, SEQ ID NO: 16 is 61% identical to human mitqgen-activated 
kinase tinase kinase 5 (GenBank ID gI679668) as determined by the Basic Local Alignment Search 

10 Tool (BLAST). (See Table 2.) The BLAST probability score is 0.0, which indicates the probability 
of obtaining the obs^ed polypeptide sequence alignment by chance. SEQ ID NO: 16 also contains 
a eukaryotic protein kinase domain as determined by searching for statistically significant matches 
in the hidden Markov model (HMM)-based PFAM database of conserved protein family domains. 
(See Table 3.) Data ftom BLIMPS, MOTIFS, and PROEILESCAN analyses provide further 

IS corroborative evidence that SEQ ID NO: 16 is a mitogen activated protein kinase kinase kinase. 

In an altemative exanq)le, SEQ ID NO: 18 is 83% identical fix)m residues 4 to 372 to mouse 
protein kinase (GenBank ID g4060S8) as detennined by the Basic Local Alignment Search Tool 
(BLAST). (See Table 2,) The BLAST probability score is 0.0, which indicates the probability of 
obtaining the observed polypeptide sequence alignment by chance. SEQ ID NO: 18 also contains a 

20 eukaryotic protein kinase domain and a PDZ domain as determined by searching for statistically 
significant matches in the hidden Markov model (HMM>based PFAM database of conserved 
protein family domains. (See Table 3.) Data fix)m BLIMPS, MOTIFS, and PROFILESCAN 
analyses provide further corroborative evid^ce that SEQ ID NO: 18 is a serine/threonine protein 
kinase. 

25 In an altemative example, SEQ ID NO: 19 is 95% identical, from residue Ml to residue 

V988, to Rattus norvegicus mytonic dystrophy kinase-related Cdc42-binding kinase (GenBank ID 
g2736151) as determmed by the Basic Local AlignnoMit Search Tool (BLAST). (See Table 2.) The 
BLAST probability score is 0.0, which indicates the probability of obtaining the observed 
polypeptide sequence alignment by chance. SEQ ID N0:19 also contains a protein kinase C 

30 terminal domain and a eukaryotic protein kinase domain as determined by searching for statistically 
significant matches in the hidden Markov model (HMM)-based PFAM database of conserved 
protein family domains. (See Table 3.) Data from BLIMPS, MOTIFS, and additional BLAST 
analyses provide furdier corroborative evidence that SEQ ID NO: 19 is a protein kinase. 

35 SEQ ID N0:2, SEQ ID NO:4, SEQ ID N0:7, SEQ ID N0:8, SEQ ID NO: 10-15, SEQ ID NO: 17. 
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and SEQ ID NO:20weie analyzed and aimotated in a sixnQarmaim^^ The algcmtfams and 
parameters for the analysis of SEQ ID NO:1-20 are desciibod in Table 7. 

As shomi in Table 4, the full lei^ polynucleotide sequences of the present invention were 
assembled using cDNA sequences or coding (exon) sequences derived from genomic DNA, or any 
5 combination of these two types of sequences. Cohmm 1 lists the polynucleotide sequence 
identification number (Polynucleotide SEQ ID NO:), the corresponding Mcyte polynucleotide 
consensus sequence number (Mcyte ID) for each polynucleotide of the invention, and the length of 
each polynucleotide sequence in basepairs. Column 2 shows the nucleotide start (S^ and stop (30 
positions of the cDNA and/or genomic sequences used to assemble the full length polynucleotide 

10 sequences of the invention, and of fragments of the polynucleotide sequences which are useful, for 
example, in hybridization or amplification technologies that identify SEQ ID NO:21-40 or that 
distinguish between SEQ ID NO:21-40 and related polynucleotide sequences. 

The polynucleotide fragments described in Column 2 of Table 4 may iiefer specifically, for 
exaisple, to Incyte cDNAs d^ved from tissue-specific cDNA libraries or from pooled cDNA 

15 libraries. Alt^natively, the polynucleotide fragments described in column 2 may refer to GenBank 
cDNAs or ESTs which contributed to the assembly of the full length polynucleotide sequences. Iq 
addition, the polynucleotide fragments described in column 2 may identify sequences derived from 
the ENSEMBL (The Sanger Centre, Cambridge, UK) database (ie., those sequences including the 
designation "ENST")- Alternatively, the polynucleotide fragments described in column 2 may be 

20 derived from the NCBI RefSeq Nucleotide Sequence Records Database (ic, those sequences 
including the designation **NM** or ^W) or the NCBI RefSeq Protein Sequence Records (Le., 
those sequences including the designation **NFO. Alternatively, the polynucleotide fragments 
described in column 2 may refer to assemblages of both cDNA and Genscan-predicted exons 
brought together by an "exon stitching" algorithm. For exanq>le, a polynucleotide sequence 

25 identified as Y^JOOOODCJ^jJfjjrYYYYJ^sJf^ represents a "stitched" sequence in which 
XXXXXX is the identification number of the cluster of sequences to which the algorithm was 
applied, and YYYYY is flje number of the prediction generated by the algorithm, and Nj^2j_y if 
present, represent spedfic exons that may have been manually edited during analysis (See Example 
V). Alternatively, die polynucleotide fi^gm^ts in column 2 may refer to assemblages of exons 

30 brought together by an "exon-stretching** algorithm. For exanq)le, a polynucleotide sequence 
identified as FLXXXXXX^^sMAAA^BBBBJLJf is a "stretched" sequence, with XXXXXX being 
the Incyte project identification number, g^AAAA being the GenBank identification niunber of the 
human genomic sequwice to which the "exon-stretching'* algorithm was applied, gBBBBB being the 
GenBank identification number or NCBI RefSeq identification number of the nearest GenBank 

35 protein homolc^, and N referring to specific exons (See Example V). Jn instances where a RefSeq 
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sequence was used as a protein homolog for the "exon-stretching" algorithin» a RefSeq identifier 
(denoted by •'NM" •*NP," or "NT") may be used in place of the GenBank identifier (Le. . gfiBBBB). 

Alt^natively» a prefix identifies component sequences that were hand-edited, predicted 
fifom genomic DNA sequences, or derived from a conibination of sequrace analysis methods. The 
5 following Table lists examples of component sequence prefixes and corresponding sequence 
analysis methods associated with the prefixes (see Example IV and Example V). 



Prefix 


Type of analysis and/or examples of programs 


GNN, GFG, 
ENST 


Exon prediction fix)m genomic sequences using, for exaiiq)le, 
GENSCAN (Stanf Old University, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK). 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Exanq>le V). 


INCY 


Full length transcript and exon prediction fiom mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the exons and resulting transcript 



In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in 
15 Table 4 was obtained to confirm the final consensus polynucleotide sequence, but the relevant 
Incyte cDNA identification numbers are not shbwn. 

Table 5 shows the representative cDNA libraries for those fidl length polynucleotide 
sequences which were assembled using hicyte cDNA sequences. The representative cDNA library 
is the Eacyte cDNA library which is most fiequentiy represented by the Incyte cDNA sequences 
20 which w^ used to assemble and confirm the above polynucleotide sequmces. The tissues and 
vectors which were used to construct the cDNA libraries shown in Table S are described in Table 6. 

The mvention also mconqkasses KAP variants. A preferred KAP variant is one which has 
at least about 80%, or altmiatively at least about 90%, or even at least about 95% amino add 
sequence identity to the ElAP amino acid sequrace, and which contains at least one functional or 
25 structoral characteristic of KAP. 

The invention also enconqmsses polynucleotides which encode KAP. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequrace selected 
fiom the group consisting of SEQ ID NO:21-40, which encodes KAP. The polynucleotide 
sequences of SEQ ID NO:21-40, as presented in the Sequrace listmg, embrace tbe equivalent RNA 
30 sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracfl, and the 
sugar backbone is conq)osed of ribose instead of deoxyribose. 

The mvCTtion also ^icompasses a variant of a polynucleotide sequence encoding KAP. In 
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paiticular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at 
least about 85%, or even at least about 95% polynucleotide sequence idmitity to die polynucleotide 
sequence ^coding KAP. A particular aspect of the invention enconq)as5es a variant of a 
polynucleotide sequence conqnising a sequence selected from the group consisting of SEQ ID 
5 NO:21-40 which has at least about 70%, or altmiatively at least about 85%, or even at least about 
95% polynucleotide sequence identity to a nucleic add sequence selected from the group consisting 
of SEQ ID NO:21-40. Any one of the polynucleotide variants described above can encode an amino 
acid sequ^ice which omtains at least one functional or structural characteristic of KAP. 

In addition, or in the alternative, a polynucleotide variant of the invention is a splice variant 

10 of a polynucleotide sequence encoding KAP. A splice variant noay have portions which have 
significant sequence identity to the polynucleotide sequence encoding KAP, but wiU generally have 
a greater or lesser number of polynucleotides due to additions or deletions of blocks of sequence 
arising from alternate splicing of exons during mRNA processing. A splice variant may have less 
than about 70%, or alternatively less than about 60%, or alternatively less than about 50% 

15 polynucleotide sequence identity to the polynucleotide sequence encoding KAP over its entire 
length; however, portions of the splice variant will have at least about 70%, or alternatively at least 
about 85%, or alternatively at least about 95%, or alternatively 100% polynucleotide sequence 
identity to portions of the polynucleotide sequence encoding KAP. Any one of the splice variants 
described above can encode an amino acid sequence which contains at least one functional or 

20 structural characteristic of KAP. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding ICAP, some bearing rniniinal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contenq)lates each and every possible variation of polynucleotide 

25 sequence that could be noade by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring KAP, and all such variations are to be considered as 
being specifically disclosed. 

Although nucleotide sequences which encode KAP and its variants are generally capable of 

30 hybridizing to the nucleotide sequence of the naturally occurring KAP under appropriately selected 
conditions of string^cy, it may be advantageous to produce nucleotide sequences encoding KAP or 
its dmvatives possessing a substantially difierent codcm usage, e.g., inclusion of non-naturally 
occurring codons. Codons may be selected to increase the rate at which expression of the peptide 
occurs in a particular prokaryotic or eukaryotic host in accordance with the fiiequency with which 

35 particular codons are utilized by the host Other reasons for substantially altering the nucleotide 
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sequence encoding KAP and its derivatives without altering tibte encoded amino acid sequences 
include the production of RNA transcripts having moie desirable prop^es, such as a greater 
half-life, than transcripts produced from the naturally occurring sequ^ce. 

The invention also ^compasses production of DNA sequences which encode KAP and 
5 KAP derivatives, or firagments thereof, entirely by syntiietic chemistry. Afler production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art Moreov^, synthetic chemistry may be used to 
introduce mutations into a sequence encoding KAP or any fiagment thereof. 

Also encompassed by the invention axe polynucleotide sequences that aie capable of 
10 hybridisdng to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:21-40 and fragments hereof under various conditions of stringency. (See, e.g., Wahl, GM. and 
SX. Berger (1987) Methods Bizymol. 152:399-407; Kimmel, A.R. (1987) Methods Oizymol. 
152:507-5 11.) Hybridization conditions, including annealing and wash conditions, are described in 
••Definitions." 

15 Methods for DNA sequencing are well known in the art and may be used to practice any of 

the embodiments of the invention. The methods may employ such enzymes as the EQenow fiagment 
of DNA polymerase I, SEQUENASE (US Biochemical, Qeveland OH), Taq polymerase (Applied 
Biosystems), thermostable T7 polyn^erase (Amersham Pharmacia Biotech, Piscataway NJ), or 
combinations of polymerases and proofieading exonucleases such as those found in the 

20 ELONGASE arnqplification system (Life Technologies, Gaithersburg MD). Preferably, sequence 
preparation is automated with maclunes such as the MICROLAB 2200 liquid transfer syst^ 
(Hamilton, Reno NV), PTC200 thermal cycler (MJ Research, Watertown MA) and ABI 
CATALYST 800 thermal cycler (Applied Biosystems). Sequencing is thea carried out using eithra* 
the ABI 373 or 377 DNA sequaidng system (Applied Biosystems), the MEGABACE 1000 DNA 

25 sequencing system (Molecular Dynamics, Sunnyvale CA), or other systems known in the art The 
resulting sequences are analyzed using a variety of algorithms which are weU known in the art 
(See, e.g., Ausubel, F.M. (1997) Short Ptotocols in Molecular Biologv. John Wiley & Sons, New 
York NY, unit 7.7; Meyers, RA. (1995) Molecular Bioloev and Biotechnologv, Wiley VCH, New 
York NY, pp. 856-853.) 

30 The nucleic acid sequences encoding KAP may be eluded utilizing a partial nucleotide 

sequence and employing various PCR-based methods known in the art to detect upstream 
sequences, such as promoters and regulatory elements. For exanq)le, one method which may be 
en^loyed, restriction-^te PGR, uses universal and nested primers to anq>lify unknown sequence 
from genomic DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PGR Methods Applic. 

35 2:318-322.) Another method, inverse PGR, uses primers that extend in divergent directions to 
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aisplij^ unknown sequence from a circularized template. The tenqilate is derived from lestriction 
fragments comprising a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et 
al. (1988) Nucleic Adds Res. 16:8186.) A third method, capture PGR, involves PGR anq>lificatiQn 
of DNA fragments adjacent to known sequences in human and yeast artijScial chromosome DNA. 
5 (See, e.g., Lagerstrom. M et al. (1991) FCR Methods Applic. 1:111-119.) In this mediod, multiple 
lestriction enzyme digestions and ligations may be used to insert an en^eeied double-stranded 
sequence into a region of unknown sequ^ice before performing PGR. Other methods which may be 
used to retrieve unknown sequences are known in the art (See, e.g., Parker, J.D. et al. (1991) 
Nucleic Acids Res. 19:3055-3060). Additionally, one may use PGR, nested primers, and 

10 PROMOTERHNDER libraries (Qontech, Palo Alto CA) to walk genomic DNA. This procedure 
avoids the need to screen libraries and is usefrd in finding intron/exon junctions. For all PGR-based 
methods, primers may be designed using commercially available software, such as OUGO 4.06 
primer analysis software (National Biosciences, Plymouth MN) or another appropriate program, to 
be about 22 to 30 nucleotides in length, to have a GG contmt of about 50% or more, and to anneal 

15 to the tenq)late at tenoperatures of about 68'*G to 72°G. 

When screening for ftdl length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a fuU-length cDNA. Genomic libraries may be useful for e^ct^ion of 

20 sequence into 5* non-transcribed regulatory regions. 

Gapillary electrophoresis systems which are comno^rcially available may be used to analyze 
the size or confirm the nucleotide sequence of sequencing or PGR products. In particular, capillary 
sequencing may en5)loy flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 

25 emitted wavelengths. Output/light intensity may be conv^ted to electrical signal using appropriate 
software (e.g., (SENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of sanq)les to conq)uter analysis and electronic data display may be computer 
controlled. Gapillary electrophoresis is especially preferable for sequencing snoall DNA fragments 
which may be present in limited amounts in a particular sanq>le. 

30 hi another embodiment of the invention, polynucleotide sequences or fiiagments thereof 

which encode KAP may be cloned in recombinant DNA molecules that direct expression of KAP, 
or fiagments or functicnal equivaloits thereof, in appropriate host cells. Due to the inherent 
deg^eracy of the genetic code, oth^ DNA sequences which encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and used to express KAP. 

35 The nucleotide sequences of the present invention can be engineered using methods 
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generally known in the art in order to alter KAP-encoding sequences for a variety of purposes 
including, but not limited to, modification of the cloning, processing, and/or wqpression of the gene 
product DNAshufQing by random fragmentation and POl reassembly of gene fragniente 
synthetic oligonucleotides may be used to engmeer the nucleotide sequences. For exanq>Ie, 
5 oligonucleotide-mediated site-directed mutagenesis may be used to introduce mutations that create 
new restriction sites, alter glycosylation patterns, change codon preference, produce splice variants, 
and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques 
such as MOLECULARBREEDING (Maxygen Inc., Santa Qara CA; described m U.S. Patent No. 

10 5,837,458; Chang, C-C. et al. (1999) Nat Biotechnol. 17:793-797; Christians. RC et al. (1999) 
Nat Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat Biotechnol. 14:315-319) to alter or 
improve die biological properties of KAP. such as its biological or enzymatic activity or its ability 
to bind to otiier molecules or conqwunds. DNA shuffling is a process by which a library of graie 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 

15 subjected to selection or screening procedures tiiat idmtify tiiose gene variants witii die desired 
properties. These preferred variants may then be pooled and fiirther subjected to recursive rounds 
of DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, firagments of a smgle gene containing 
random point mutations may be recombined, screened, and flien reshuffled until die desired 

20 properties are optimized. Alternatively, fragments of a given gene may be recombined witii 
fragments of homologous genes in die same gene family, eitiier from die same or different species, 
diereby maximizing die genetic diversity of multiple naturally occurring genes in a directed and 
controllable manner. 

In anodier embodiment, sequences encoding KAP may be syndiesized, in whole or in part. 

25 using chemical mediods well known in die art (See, e.g., Carudiers, MJL et al. (1980) Nucleic 
Acids Synq). Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) 
Alternatively, KAP itself or a fragment diereof may be syntiiesized using chemical mediods. For 
exanq)le, peptide synthesis can be performed using various solution-phase or solid-phase 
techniques. (See, e.g., Creighton, T. (1984) Protems, Structures and Molecular Properties. WH 

30 fteeman. New York NY, pp. 55-60; and Roberge, J. Y. et al. (1995) Science 269:202-204.) 
Automated synthesis may be achieved using the ABI 431 A pq)tide syndiesizer (Applied 
Biosystems). Additionally, die amino acid sequence of KAP, or any part diereof, may be altered 
during direct syndiesis and/or combined widi sequences from odier proteins, or any part diereof, to 
produce a variant polypeptide or a polypeptide havmg a sequence of a naturally occurring 

35 poIypq)tide. 
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The peptide may be substantially purified by preparative high perfonnance liquid 
chromatography. (See. e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods EnzymoL 182:392-421.) 
The conq)osition of the synthetic peptides may be confinned by amino add analysis or by 
sequencing. (See, e.g., Qneighton, supra, pp. 28-53.) 
5 In order to express a biologically active KAP, the nucleotide sequences encoding KAP or 

derivatives thereof may be inserted mto an appropriate expression vector, Le., a vector which 
contains the necessary elements for transcriptional and translational control of the inserted coding 
sequence in a suitable host. These elements include regulatory sequences, such as enhancers, 
constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector and in 

10 polynucleotide sequences encoding KAP. Such elements may vary in their strength and specificity. 
Specific initiation signals may also be used to achieve more efficient translation of sequences 
encoding KAP. Such signals include the ATG initiation codon and adjacent sequences, e.g. the 
Kozak sequence. In cases where sequences encoding KAP and its initiation codon and upstream 
regulatory sequences are insaled into the appropriate expression vector, no additional 

15 transcriptional or translational control signals may be needed. However, in cases where only coding 
sequence, or a fragment thereof, is inserted, exogenous translational control signals including an in- 
frame ATG initiation codon should be provided by the vector. Exogenous translational elements . 
and initiation codons may be of various origins, both natural and synthetic. The efficiency of 
expression may be enhanced by the inclusion of enhancers appropriate for the particular host cell 

20 system used. (See, e.g., Scharf, D. et al. (1994) Results Piobl. Cell Differ. 20: 125-162.) 

Methods which are well known to those skilled in the art may be used to construct 
expression vectors containing sequences encoding KAP and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant DNA techniques, 
synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, J. et aL (1989) 

25 Molecular Qoning. A Laboratorv Manual. Cold Spring Harbor Press, Plainview hfY, ch. 4, 8, and 
16-17; Ausubel, F.M. et al. (1995) Current Protocols in Molecular Biologv. John Wiley & Sons, 
New Yodc NY, ch. 9, 13, and 16.) 

A variety of expression vectoi/host systems may be utilized to contain and express 
sequences encoding KAP. These include, but are not limited to, microorganisms such as bacteria 

30 transformed with recombinant bacteriophage plasmid, or cosmid DNA expression vectors; yeast 
transformed witii yeast esqnession vectors; insect ceU systems infected with viral e^qwession vectors 
(e.g., baculovirus); plant cell systems tcansfonned with viral expression vectors (e.g., cauliflower 
mosaic vkus, CaMV. or tobacco mosaic virus, IMV) or with bacterial expression vectors (e.g., Ti 
or pBR322 plasmids); or animal ceD systems. (See, e.g., Sambrook, supra: Ausubel, sug^ Van 

35 Heefce, G. and SM. Schuster (1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K et al. (1994) 
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Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; 
Takamatsu, N. (1987) BMBO J. 6:307-311; The McGiaw IBll Yearbook of Sdence and Technolo g y 
(1992) McGraw Hill, New York NY. pp. 191-196; Logan. J. and T. Shenk (1984) Rroc. Natl. Acad. 
Sci. USA 81:3655-3659; and Harrington, J J. et al. (1997) Nat CteieL 15:345-355.) Expression 
5 vectors dmved from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from various 
bacterial plasauds, may be used for delivery of nucleotide sequences to the targeted organ, tissue, or 
cell population. (See, e.g., Di Nicola. M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et 
aL (1993) Ptoc. NatL AcaA Sci. USA 90(13):6340-6344; Buller. R.M. et al. (1985) Nature 
317(6040):813-815; McGregor, DP. et al. (1994) Mol. Immunol. 31(3):219-226; and Vmna, LM. 

10 and N. Somia (1997) Nature 389:239-242.) The invention is not limited by the host cell en^loyed. 

In bacterial systems, a numb^ of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences ^coding KAP. For exaiiq>le, routine cloning, 
subcloning, and propagation of polynucleotide sequences ^coding KAP can be achieved using a 
multifunctional B. coli vector such as PBLUESC3RIPT (Stratagene, La Jolla CA) or PSPORTl 

15 plasmid (Life Technologies). Ligation of sequences encoding KAP into the vector's multiple 
cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of 
transformed bacteria containing recombinant molecules. In addition, these vectors may be usefid 
for in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation 
of nested deletions in the cloned sequence. (See, e.g.. Van Heefce, G. and S.M. Schuster (1989) J. 

20 Biol. Chem 264:5503-5509.) When large quantities of KAP are needed, e.g. for the production of 
antibodies, vectors which direct high level expression of KAP may be used. For example, vectors 
containing the strong, inducible SP6 or T7 bacteriophage promoter may be used. 

Yeast egression systems may be used for production of KAP. A number of vectors 
containing constitutive or ioducible promoters, such as alpha factor, alcohol oxidase, and PGH 

25 promoters, may be used in the veast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 
vectors direct eitb^ the secretion or intracellular retention of expressed proteins and enable 
integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 
1995. supra : Bitter, GA. et al. (1987) Metiiods Bizymol. 153:516-544; and Scorer, CA. et al. 
(1994) Bio/Technology 12:181-184.) 

30 Plantsystemsmay also be used for expression of KAP. Transcription of sequences 

encoding KAP may be driv^a by viral promoters, e.g., the 358 and 19S promoters of CaMV used 
alone or m combmation with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO 
J. 6:307-311). Altematively, plant promoters such as tiie small subunit of RUBISCX) or heat shock 
promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et 

35 al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. CeU Differ. 17:85-105.) 
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These constructs can be intnxluced into plant cells by direct DNA tiansfannation or 
palbogen-mediated transfection. (See> e.g.. The NfcGiaw Hill Yearbook of Science and Technolo|gv 
(1992) McGraw HBU, New York NY, pp. 191-196.) 

In maiTiTnalian cells, a numbCT of viral-based expression systems may be utilized. Li cases 
S where an adenovirus is used as an expression vector, sequences ^coding EAP may be ligated into 
an adenovirus transcription/translation conqplex consisting of the late promoter and tripartite leader 
sequence. Ins^on in a non-essentid El or E3 region of the vnal genome may be used to obtain 
infective virus which expresses KAP in host cells. (See, e.g., Logan, J. and T. Sheuk (1984) Proc. 
Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous 

10 sarcoma virus (RSV) enhancer, may be used tn increase eTprftSRinn in Tnatmmliflp host fells. SV40 
or EB V-based vectors may also be used for hig^-level protein expression. 

Human artificial chromosomes (E[ACs) may also be enQ>Ioyed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional deliv^ methods (liposomes, polycationic amino 

15 polymers, or vesicles) for therapeutic pmposes. (See, e.g., Harrington, J,J. et aL (1997) Nat Genet 
15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable egression 
of KAP in cell lines is preferred. For exan^le, sequences encoding KAP can be transformed into 
cell lines using expression vectors which may contain viral origins of replication and/or endogenous 

20 expression elements and a selectable marker gene on the same or on a separate vector. Following 
th& introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media 
before being switched to selective media. The purpose of the selectable marker is to confer 
resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed ceUs may be 

25 propagated using tissue culture techniques ^)propriate to the cell type. 

Any nuinber of selection systems inay be used to recover transforined cell lines. These 
include, but are not limited to, the herpes siiiq)lex virus thymidine kinase and adenine 
phosphoribosyhransferase genes, for use in tk' and apr cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cefl 11:223-232; Lowy, L et al. (1980) Cell 22:817-823.) Also, antimetaboUte, antibiotic, 

30 OT herlncide resistance can be used as the basis for selection. For exanq>le, dhfr confers resistance 
to methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and 
pat confer resistance to chlorsulfurcm and phosphinotridn acetyltransferase, respectively. (See, e.g., 
Wigler, M. et al. (1980) Proc NatL Acad. Sci USA 77:3567-3570; Colberc-Gar^in, F. et al. (1981) 
J. Mol. BioL 150: 1-14.) Additional selectable genes have hem described, e.g., trpB and hisD, 

35 which alter ceUular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C Mulligan 
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(1988) Proc. Nad. Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green 
fluorescent proteins (GPP; QontechX fi glucuronidase and its substrate fi-glucuionide, or luciferase 
and its substrate luciferin noay be used. These markers can be used not only to identify 
transformants, but also to quantify the amount of transient or stable protein expression attributable 
5 to a specific vector system. (See. e.g.. Rhodes, C.A. (1995) Methods MoL Biol. 55:121-131.) 

Although piesence/absrace of marker gene expression suggests that the gene of tnteiest 
is also present, the presence and rapiession of the gene may need to be confb:^^ Forexanq>le,if 
the sequence encoding KAP is inserted within a marker gone sequence, transformed cells containing 
sequences encoding KAP can be identified by the abs^ce of marker gene function. Alternatively, a 

1 0 marter gene can be placed in tandem with a sequence encoding KAP under the control of a single 
promoter. Bxpres^onof the marioBr gene in response to induction or selection usuaUy indicate 
expression of the tandem gene as well. 

In general, host cells that ccmtain the nucleic acid sequmce encoding KAP and that express 
KAPmybeidentifiedby a variety of procedures known to those of skill in the art These 

15 procedures include, but are not limited to, DNA-DNA or DNA^RNA hybridizations, PCR 

anq>lification, and protein bioassay or immunoassay techniques which include membrane, soluticm, 
or chip based technologies for the detection and/or quantification of nucleic acid or i»otein 
sequences. 

Immunological methods for detecting and measuring the expression of KAP using either 

20 specific polyclonal or monoclonal antibodies are known in the art Exanq>les of such techniques 
include enzyme-linked inmumosorbent assays (ELISAs), mdioimmunoassays (RIAs), and 
fluorescence activated ceU sorting ^ACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-int^ering epitopes on KAP is preferred, but a 
competitive binding assay may be enq)loyed These and other assays are weD known in the art 

25 (See, e.g., Hanq)ton, R. et al. (1990) Serological Nfethods. a Laboratorv Manual. APS Press, St Paul 
MN, Sect IV; Coligan, JJB. et al. (1997) Current Protocols in Immunology. Greene Pub. Associates 
and Wiley-hiterscience, New York NY; and Pound, J.D. (1998) Inmiunochemical Protocols, 
Humana Press, Totowa NJ.) 

A wide vari^ of labels and conjugation techniques are known by those skilled in the art 

30 and may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridizati(m or PCR probes for detecting sequences related to polynucleotides encoding KAP 
include oligolabeling, nick translation, end4abeling, or PCR anq)lification using a labeled 
nucleotide. Ahematively, the sequences encoding KAP, or any fragments thereof, may be cloned 
into a vector for the production of an mRNA probe. Such vectors are known in the art, are 

35 commercially available, and may be used to synthesize RNA probes in vitro by addition of an 
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appropriate RNA polym^e such as T7, T3, or SP6 and labeled nucleotides. These proceduies 
may be conducted udng a variety of comDoerciany available kits, such as those provided by 
AinershamPhaimacia Biotech, Promega (Madison WI^ Suitable reporter 

molecules or labels which may be used for ease of detection include radionuclides, enzymes, 
5 fluorescent, chemiluminescent, or chromog^c agents, as weU as substrates, cofactors, inhibitors, 
magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding KAP may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracdlularly dqpendiiig on the 

10 sequence and/or the vector used. As will be understood by those of skill in the art, expression 
vectors containing polynucleotides which oicode KAP may be designed to contain signal sequences 
which direct secretion of KAP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host ceU strain may be chosm for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 

15 the polypeptide include, but are not linnted to, acetylation, caiboxylation, glycosylation, 

phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a '*prepro" 
or '"pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 
Different host cells which have specific cellular machinery and characteristic mechanisms for 
post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WD8) are available fit>m the 

20 American Type Culture Collection (ATCC, Manassas VA) and may be chosra to ensure the correct 
modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding KAP may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For exanq)le, a chimeric KAP protein 

25 containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the scre^iing of peptide libraries for inhibitors of KAP activity. Heterologous protein and 
peptide moieties may also facilitate purification of fusion proteins using conomercially available 
affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 
maltose binding protein (MBP), thioredoxin (Tnt), calmodulin binding peptide (CBP), 6-ffis, 

30 FLAG, c-myc, and hemagglutrnm (HA). GST, MBP, Trx, CBP, and 6-His enable purification of 
their cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, 
and n^tal-chelate resins, respectively. FLAG, c-myc, and hemagglutimn (HA) enable 
immunoaffinity purification of fusion proteins using commercially available monoclonal and 
polyclonal antibodies that specifically recognize these epitope tags. A fusion protein may also be 

35 engineered to contain a proteolytic cleavagp site located between the KAP encoding sequence and 
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the hetoologous protein sequence, so that KAP may be cleaved away from the hetmlogous moiety 
following purification. Methods for fusion protein e^qnession and purification are discussed in 
Ausubel (1995, supra, ch. 10). A variety of commercially available kits may also be used to 
facilitate e?q>ression and purification of fusion proteins. 
5 Id a furdier ^zibodin^t of the inventioo, synthesis of radiolabeled KAP may be achieved Jn 

vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Etomega). These 
systems couple transcription and translation of protein-coding sequences operably associated with 
the T7, T3, or SP6 promoters. Translation takes place in tihe presence of a radiolabeled amino acid 
precursor, for example, ^S-methionine. 

10 KAP of the present invention or fragments thereof may be used to screen for conq>ounds 

tibat specifically bind to KAP. At least one and iq) to a pluraKty of test coiioq[>ounds may be screened 
for specific binding to KAP. Examples of test conqpounds include antibodies, oligonucleotides, 
proteins (e.g., receptors), or small molecules. 

In one eoabodiment, the compound thus id^lified is closely related to the natural ligand of 

IS KAP, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See. e.g., Coligan, J.R et aL (1991) Current Protocols in Timn iinnlnp;Y 
l(2):_Chapter S.) Similariy, the conqpound can be closely related to the natural receptor to which 
KAP binds, or to at least a fiiagment of the reenter, e.g., the ligand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 

20 these conqpounds mvolves producing appn^ate cells which express KAP, either as a secceted 
protein or on the cell membrane. Preferred cells include cells from mamtnalje, yeast, T>mRfiplii1a^ or 
R coli . Cells ejqnessing KAP or ceU membrane fractions which contain KAP are then contacted 
with a test con^und and binding, stimulation, or inhibition of activity of either KAP or the 
compound is analyzed. 

25 An assay may simply test bindmg of a test conq[)ound to the polypeptide, wherein trinding is 

detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For exanople, 
the assay may conqnise the steps of combining at least one test compound with KAP, eidier in 
solution or a£5xed to a solid support, and detecting tiie binding of KAP to the con^und. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 

30 labeled competitor. Additionally, the assay may be carried out using ceD-free preparations, 

chemical libraries, or natural product mixtures, and the test compound(s) may be fiiee in solution or 
affixed to a solid support 

KAP of the present invention or fragments thereof may be used to screen for conq>ounds 
that modulate the activity of KAP. Such conxpounds may include agonists, antagonists, or partial or 

35 inverse agonists. In one embodiment, an assay is performed under conditions permissive for KAP 
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activity, wherein KAP is combined with at least one test compound, and the activity of KAP in the 
presence of a test compound is conq>aied with the activity of KAP in the absence of the test 
con^uid. A change in the activity of ElAP in the presence of the test compound is indicative of a 
con^und that modulates the activity of KAP. Alternatively, a test compound is combined with an 
5 in vitro or cell-fiee system comprismg KAP under conditions suitable for KAP activity, and the 
assay is performed. Ineith^of these assays, a test coinpoundwMch modulates the activity of KAP 
may do so indirectiy and need not come in direct contact with the test conq>ound. At least one and 
up to a plurality of test con^unds may be screened. 

Jn another embodiment, polynucleotides encoding KLAP or their TnaTnTnalia^ homologs may 

10 be '^knocked ouf * in an animal model system using homologous lecondnnation in embryonic stem 
(ES)cells. Such techniques are weUImown in tiie art and are useful for the generation of anim 
models of human disease. (See. e.g., U.S. Patmt No. 5,175383 and U.S. Patent No. 5.767,337.) 
For example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse 
embryo and grown in culture. The ES cells are transformed with a vector containing the gene of 

15 interest disrupted by a marker g^e, e.g., the neomycin phosphotransferase g^e (neo; Capecchi, 
MR. (1989) Science 244:1288-1292). The vector integrates into tiiecorrespondmg region of the 
host genozue by homologous recomWnarion. Alternatively, homologous recombination takes place 
using the Cte-loxP system to knockout a gene of interest in a tissue- or developmentai stage-specific 
manner (Marth, J.D. (1996) Clin. Invest 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids 

20 Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 

blastocysts such as those from the C57BLy6 mouse strain. The blastocysts are surgically transferred 
to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 
heterozygous or homozygous strains. Transg^c animals thus generated may be tested with 
potential therapeutic or toxic agents. 

25 Polynucleotides encoding KAP may also be manipulated in vitro in ES cells derived fix)m 

human blastocysts. Human ES cells have the potential to difTere^tiate into at least ei^A sq>arate 
cell lineages including endodenn, mesoderm, and ectodermal cell types. These cell lineages 
differentiate into, for exanq)le, neural cells, hematopoietic lineages, and cardion:^ocytes (Thomson, 
J.A. et al. (1998) Science 282:1145-1147). 

30 Polynucleotides ^coding KAP can also be used to create 'Tcnocldn" humanized animals 

(pigs) or transgenic animals (mice or mts) to model human disease. With knockin technology, a 
region of a polynucleotide encoding KAP is injected into animal ES cells, and the injected sequence 
integrates into the animal cell graome. Transformed cells are injected into blastulae, and the 
blastulae are in:q)lanted as described above. Transgenic progeny or inbred lines are studied and 

35 treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
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Alternatively, a manmal inbied to oveiexpress KAP, e.g.» by secreting KAP in its mil]^ may also 
serve as a convenient source of that protein (Jaime, J. et al. (1998) Biotecfanol. Annu. Rev. 4:SS-74). 
TBQERAPEUnCS 

Chemical and structural similarity, e.g., in the context of sequences and moti&, exists 
5 between regions of KAP and kinases and phosphatases. In addition, exanq)les of tissues expressing 
KAP can he found in Table 6. Therefore,KAP^q)ears to play a role in cardiovascular diseases, 
immune system disorders, neurological disorders, disorders affecting growth and development, lipid 
disorders, cell proliferative disorders, and cancers. In the treatment of disorders associated with 
increased KAP e;ipression or activity, it is desirable to decrease the expression or activity of KAP. 

10 In the treatment of disorders associated with decreased KAP expression or activity, it is desirable to 
mcrease the expression or activity of KAP. 

Therefore, in one embodiment, KAP or a firagn^nt or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of KAP. Examples of such disorders include, but are not limited to, a cardiovascular 

IS disorder such as artmovCTOus fistula, atherosclerosis, hypertension, vasculitis, Raynaud^ disease, 
aneurysms, arterial dissections, varicose vdns, thrombophlebitis and phlebothrombosis, vascular 
tumors,'and conqilications of thrombolysis, balloon angioplasty, vascular rq)lacement, and coronary 
artery bypass graft: surgery, congestive heart failure, ischemic heart disease, an^ma pectoris, 
myocardial infarction, hypertensive heart disease, degenerative valvular heart disease, calcific aortic 

20 valve straosis, cong^tally Incuspid ac»tic valve, mitral annular calcification, imtral valve prolapse, 
rheumatic fevar and rheumatic heart disease, infective endocarditis, nonbacterial thrombotic 
endocarditis, endocarditis of systemic lupus erythematosus, carcinoid heart disease, 
cardiomyopathy, myocarditis, pericarditis, neoplastic heart disease, congenital heart disease, and 
conq>lications of cardiac transplantation, coi^nital lung anomalies, atelectasis, puhnonary 

25 congestion and edema, pulmonary embolism, pulmonary hemonbage, pulmonary infarction, 
pulmonary hypertension, vascular sclerosis, obstructive pulmonary disease, restrictive pulmonary 
disease, chronic obstructive puhnonary disease, enqphysema, chronic bronchitis, bronchial asthma, 
bronchiectasis, bacterial pneumonia, viral and mycoplasmal pneumonia, lung abscess, pulmonary 
tuberculosis, diffuse interstitial diseases, pneumoconioses, sarcoidosis, idiopathic pulmonary 

30 fibrosis, desquamative interstitial pneumonitis, hypersensitivity pneumonitis, pulmonary 
eosinophilia bronchiolitis obliterans-organizing pneumonia, diffuse pulmonary hemorrhage 
syndromes, Goo^asture's syndromes, idiopathic pulmonary hemosiderosis, pulmonary involvement 
in collagen-vascular disorders, pulmonary alveolar proteinosis, limg tumors, inflammatory and 
noninflammatory pleural effusions, pneumothorax, pleural tumors, drug-induced lung disease, 

3S radiation-induced lung disease, and complications of lung transplantation; an immune dis(»der such 
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as acquired umnunodeficimcy syn^me (AIDS), Addison's disease, adult lespiratoiy distress 
syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, 
autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopathy- 
candidiasis-ectod^xoal dystrophy (AFECED), bronchitis, cholecystitis, contact d^madtis, Crohn's 

5 disease, atqpic dermatitis, dermatomyositis, diabetes mellitus, enqphysema, episodic lymphopenia 
with lynq)hocytotoxins, aythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
glomerulonephritis, Goodpasture's syndrome, gout. Graves* disease, Haslmnoto's thyroiditis, 
hypereosinophilia, irritable bowel syndronoe, multiple sclerosis, myasAenia gravis, myocardial car 
pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter*s 

10 syndrome, riieumatoid arthritis, scleroderma, Sjogren's syndrome, systemic am^hylaxis, systemic 
lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, 
Werner syndronoe, conqilicaticms of cancer, hmodialysis, and extracorporeal circulation, viral, 
bacterial, fimgal, parasitic, protozoal, and helminthic infections, and trauma; a neurological disorder 
such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's 

15 disease^ Pick's disease, Huntington's disease, dementia, Parkinson's disease and other 
extrapyramidal disordos, amyotrophic lateral sclerosis and other motor neuron disorders, 
progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, nniltiple sclerosis and 
other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, 
epidural abscess, suppurative intracranial thrombophlebitis, niyelitis and radiculitis, viral central • 

20 nervous system disease, pnon diseases including koru, Creutzfeldt-Jakob disease, and Gerstmann* 
Straussler-Scheinker syndrome, fiatal familial insomnia, nutritional and metabolic diseases of the 
nmous system, neurofibromatosis, tubmus sclerosis, cerebeUoietmal heman^oblastomatods, 
encephalotrigeminal syndrome, meatal retardation and other develc^mental disorders of the cratral 
nervous system including Down syndrome, cCTebral palsy, neuroskeletal disorders, autonomic 

25 nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and 
otii^ neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and 
polymyositis, inherited, metabolic, endocrine, and toxic myopatiiies, myasthenia gravis, periodic 
paralysis, mental disordm including mood, anxiety, and schizophrenic disorders, seasonal affective 
disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, 

30 paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, 
corticobasal degeneration, and familial &ontoteiiq)oral dementia; a growth and developmental 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, 
mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 
polycythemia vera, psoriasis, primary thrombocythemia, renal tubular acidosis, anemia, Cushing's 

35 syndrome, achondroplastic dwarfism, Ducheime and Beck^ muscular dystrophy, epilepsy, gonadal 



55 



wo 02/46384 



PCTAUSOl/47431 



dysgenesis, WAGR syndrome CWIlms' tumor, aniridia, g^tonrinaiy abnormaUties, and mratal 
retaidationX Smith-Mag^iis syndrome, myelodysplastic syndrome, hereditaiy mucoepithelial 
dysplasia, hereditary keratodeimas, hereditary neuropadiies such as Charcot-Marie-Tooth disease 
and neurolBbromatosis, hypothyroidism, hydrocephalus, seizure disorders such as Syndenham's 
5 chorea and cerebral palsy, spina bifida, anencq>haly, craniorachischisis, congenital glaucoma, 
cataract, and s^orineural hearing loss; a lipid disorder such as fatty liver, cholestasis, primary 
biliary cirrhosis, carnitine deficiency, carnitine pahnitoyltransfeiase deficiency, myoadenylate 
deaminase deficiency, hypertriglyceridemia, lipid storage disorders such Fabiy's disease, Gauch^'s 
disease, Niemann-Pick's disease, metachromatic leukodystrophy, adrenoleukodystrophy, GM, 

10 gangliosidosis, and ceroid lipofuscinosis, abetalqKiproteinemia, Tangier disease, 
hyperiipoproteinemia, diabetes mellitus, Hpodystrophy, lipomatoses, acute panniculitis, 
disseminated fat necrosis, adiposis dolorosa, l^id adrenal hyperplasia, fnininyi^l change disease, 
lipomas, atherosclerosis, hypercholesterolemia, hypercholesterolemia with hypertriglyceridemia, 
primary hypoalphalipoprotdnemia, hypothyroidism, renal disease, livor disease^ lecithin:cholesten>l 

15 acyltransferase deficiency, cerebrotendinous xanAomatosis, sitosterolemia, hypocholesterolemia, - 
Tay-Sachs disease, Sandhoffs disease, hyperlipidemia, hyperlipemia, lipid myopathies, and 
obesity; and a ceU proliferative disorder such as actinic keratosis, arterioscleros^^ . 
bursitis, cinhosis, hqpatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal 
nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary tfarombocythemia, and cancers 

20 including adenocarcinoma, leukoma, lynq>homa, melancmia, myeloma, sarcoma, teratocarcinama, 
and, in particular, cancers of the adrenal gland; bladder, bone, bone marrow, brain, breast, cervix, 
gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, 
parathyroid, penis, prostate, salivary glands, skin, spleoi, testis, diymus, thyroid, uterus, leukmnias 
such as multiple myeloma, and lymphomas such as Hodgkin's disease.. 

25 In anoth^ embodiment, a vector capable of expressing KAP or a firagment or derivative 

thmof may be administered to a subject to treat or prevrat a disorder associated with decreased 
expression or activity of KAP including, but not Imdted to, those described above. 

In afurther embodimeait, a compositim comfnising a substantially purified SLAP in 
conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or 

30 prevent a disorder associated with decreased esqxression or activity of KAP mcluding, but not 
limited to, those provided above. 

In still another embodiment, an agonist which modulates the activity of KAP may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of KAP including, but not limited to, those listed above. 

35 In a further embodiment, an antagoiust of KAP may be administaered to a subject to treat or 
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prevent a disorder associated with iacreased expression or activity of KAP. Exan^les of such 
disorders include, but are not limited to, those cardiovascular diseases, immune system disorders, 
neiuDlogical disorders, disorders affecting growth and development, lipid disorders, cell 
proliferative discHtdm, and cancers described above, hi one aspect, an antibody which specifically 
S bmds KAP may be used directly as an antagonist or indirectly as a taigetmg or delivery mechanism 
for bringmg a pharmaceutical agent to cells or tissues which express KAP. 

Iq an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding KAP may be administered to a subject to treat or prevent a disorder associated wifli 
mcreased e^qnession or activity of KAP includmg, but not lumted to, those desoibed above. 

10 hi other embodiments, any of the proteins, antagonists, antibodies, agonists, coxq>leinentary 

sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate ag^ts for use in combination therapy may be 
by one of ordinary skill in the art, according to conventi(»ial jdiannaceutical principles. The 
combination of therapeutic agents may act synergistically to effect die treatment or prevention of the 

15 various disorders described above. Using this ap]m>ach, one may be able to achieve therapeutic 
efiScacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of KAP may be produced using methods which are generally known in the 
art hi particular, puriiSed KAP may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind KAP. Antibodies to KAP may also 

20 be generated using methods that are well known in the art Such antibodies may include, but are not 
limited to, polyclonal, monoclonal, chimeric, and single chain antibodies. Fab fi-agments, and 
fragments produced by a Fab expression library. Neutralizing antibodies (i,e., those which inhibit 
dimer formation) are generally preferred for th^:apeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 

25 and others may be immunized by injection with KAP or with any firagn^nt or oligopeptide thereof 
which has imnnmogenic properties. Dq)ending on die host species, various adjuvants may be used 
to increase immunological response. Such adjuvants mclude, but are not limited to, Reund's, 
mineral ^Is such as aluminum hydroxide, and surface active substances such as lysolecithin, 
pluronic polyoLs, polyanions, peptides, oil emulsions, KLH, and dinitrophenoL Among adjuvants 

30 used in humans, BCG (bacilli Cahnette-Guerin) and Corvnebac ^ftrinnn, parvum niY* ftgjwnaiiy 
preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
KAP have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
35 firagments are identical to a portion of the amino acid sequence of the natural protein. Short 
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stretches of KAP ammo adds may be fused with those of another protem, such as KLH, and 

antibodies to the chimeiic molecule may be produced. 

Monoclonal antibodies to KAP may be prepared using any technique which provides for the 

production of antibody molecules by continuous cell lines in culture. These include, but are not 
5 limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EB V- 

hybridoma technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. 

(1985) J. Immunol. Methods 81:3142; Cote, RJ. et al. (1983) Proc. Natl. Acad. ScL USA 

80:2026-2030; and Cole, SP. et al. (1984) Mol. Cell Biol. 62: 109-120.) 

In addition, techniques developed for the prodwtion of "cfaimmc antibodies," such as the 
10 splicing of mouse antibocfy genes to human antibody genes to obtain a noolecule with appropriate 

autig^ specificity and biological activity, can be used. (See, e.g., Morrison, SX. et al. (1984) Proc. 

Natl. Acad. Sci. USA 81:6851-«855; Neubager, M.S. et aL (1984) Nature 312:604-608; and 

Takeda. S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the 

production of single chain antibodies may be ad^ted, usfaig methods known in the art, to produce 
15 EAP-specific single chain antibodies. Antibodies with related specificity, but of distmct idiotypic 

conq>osition, may be generated by chain shufiDing from random combinatorial immunoglobulin 

libraries. (See. e.g.. Burton, D.R. (1991) Proc. Natl. Acad. ScL USA 88:10134-10137.) 

• Antibodies may also be produced by inducing in vivo production in the lynq>hocyte ' 

population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
20 disclosed in the literature. (See, e.g., Orlandi,.R. et al. (1989) Proc. Natl. Acad. Sci. USA 

86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fiagments which contain specific binding sites for KAP may also be generated. 

For example, such fi:agmeats include, but are not limited to, F(ab')2 fiagments produced by pepsin 

digestion of the antibody molecule and Fab firagn^nts generated by reducing the disulfide bridges of 
25 the F(ab*)2 fragnmts. Alternatively, Fab expression libraries may be constructed to allow rapid and 

easy identification of monoclonal Fab firagments with the desked specificity. (See, e.g., Huse, WJ>. 

et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 

specificity. Numerous protocols for conq)etitive binding or immunoradiometric assays using either 
30 polyclonal or monocl(mal antibodies with established specificities are weU known in the art Such 

immunoassays typically involve the n^asurement of conq>lex formation between KAP and its 

specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 

reactive to two non-interfering KAP epitopes is generally used, but a conq)etitive binding assay may 

also be employed (Pound, supraV 
35 Various methods such as Scalchard analysis in conjunction with radioimmunoassay 
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techniques may be used to assess Oie afEinity of antibodies for KAP. A£Bnity is expressed as an 
association constant, K., which is defined as the molar concentration of KAP-antibody complex 
divided by die molar concmtrations of firee antigen and firee antibody under equilibrium conditions. 
The deteimined for a preparation of pdyclcmal antibodies, which are heterogeneous in their 
5 affinities for multiple ICAP epitopes, represents the average afBnity, or avidity, of the antibodies for 
KAP. The Kb determined for a preparation of monoclonal antibodies, which are monospecific for a 
particular KAP epitope, represents a true measure of affinity. High-affinity antibody preparations 
with Ka ranging from about lO' to 10^^ IJmole are prefened for use in immunoassays in which the 
KAP-antibody conqilex must withstand rigorous manq>uIations. Low-affinity antibody preparations 
10 with K. ranging from about 10^ to 10^ L/mole are preferred for use in immunopurification and 
shnilar procedures which ultimatdy require dissociation of KAP, prefinably in active form, fiom the 
antibody (Catty, D. (1988) Antibodies, V nlnmft T> A Pi actical Approach, IRL Press, Washington 
DC; Liddell, J.R and A. Oyer (1991) A Practical Guide to Monoclonal Antibodies. John Wiley & 
Sons, New York NY). 

15 The titer and avidity of polyclonal antibody preparations may be further evaluated to 

determine the quality and suitability of such preparations for certain downstream applications. For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/ml, is generally enq)loyed in procedures reqmring 
precipitation of KAP-antibody conq>lexes. Ptocedures for evaluating antibody specificity, titer, and 

20 avidity, and guidelines for antibody quality and usage in various ^plications, are generally 
available. (See, e.g.. Catty, supra , and Coligan et al. supra .) 

Jn another embodiment of the invention, the polynucleotides ^coding EIAP, or any 
fragment or con^>lenient thereof, n^y be used for therapeutic purposes. In one aspect, 
modifications of gene expression can be achieved by designing conq>lementary sequraces or 

25 antisense molecules (DNA, RNA, PNA, or modified oligonucleotides) to the coding or regulator 
regions of the gene encoding KAP. Such technology is well known in the art, and antisense 
oligonucleotides or larger fragments can be designed from various locations along the coding or 
control regions of sequences encoding KAP. (See, e.g., Agrawal S., ed. (1996) Antisense 
Therapeutics. Humana Press Inc., Totawa NJ.) 

30 In thaapeutic use, any gaie delivery system suitable for introduction of the antisense 

sequences mto appropriate target cells can be used. Antisense sequences can be delivered 
intracellularly in the form of an expression plasmid which, iq>on transcription, produces a sequence 
complementary to at least a pcntion of the cellular sequaice encoding the target protem. (See, e.g.. 
Slater, J.R et aL (1998) L AUergy Clin. Lnmunol. 102(3):469-475; and Scanlon, KJ. et aL (1995) 

35 9(13): 1288-1296.) Antisense sequences can also be introduced intraceUularly through the use of 
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viral vectors, such as retrovirus and adeno-assodated vims vectors. (See, e.g., Mill^, AJ>. (1990) 
Blood 76:271; Ausubel, supra: Uckat. W. and W. Walther (1994) PhaimacoL Ther. 63(3):323- 
347.) Other gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, 
and other systems known in the art (See, e.g., Rossi, J. J. (1995) Br. Med. Bull. S1(1):217-22S; 

5 Boado, RJ. et al. (1998) J. Pharm. Sci. 87(11):1308-1315; and Monis. M.C et al. (1997) Nucleic 
Acids Res. 25(14):2730-2736.) 

In another embodimBnt of the invention, polynucleotides encoding KAP may be used for 
somatic or germline gene therapy. Gene therapy may be p^ormed to (i) correct a g^etic 
deficiency (e.g., in the cases of severe combined immunodeficiency (SCID)-X1 disease 

10 characterized by X-linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-^72), 
severe combined immunodeficiency syndrome associated with an inherited adenosine deaminase 
(ADA) deficiency (Blaese. R.M. et al. (1995) Science 270:475-480; Bordignon, C et al. (1995) 
Science 270:470-475), cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. 
(1995) Hum. Gcaie Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), 

15 tfialassamias, familial hypercholesterolemia, and hemophilia resulting bom Factor Vm or Factor IX 
deficiencies (Crystal, R.G. (1995) Science 270:404^10; Vomoa, IM. and N. Somia (1997) Nature 
389:239-242)), (ii) express a conditionally lethal g^ product (e.g., in the case of cancers which 
result from unregulated cell proliferation), or (iii) express a protein which affords protection against 
intracellular parasites (e.g., against human retroviruses, such as human immunodeficiency virus 

20 (HIV) CBaltimore, D. (1988) Nature 335:395-396; Poeschla, R et al. (1996) Proc. NatL Acad. Sci. 
USA 93: 1 1395-1 1399), hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida 
albicans and Paracoccidioides brasiliensis: and protozoan parasites such as Plasmodium felcip anitn 
and Trypanosoma cru2i\ In the case where a genetic deficiency in BIAP e^qxression or regulation 
causes disease, the expression of KAP from an appropriate populaticm of transduced cells may 

25 alleviate the clinical manifestations caused by the genetic deficiency. 

In a finther embodim^t of the invention, diseases or disorders caused by deficiencies in 
KAP are treated by constructing mammalian expression vectors encoding KAP and introducing 
tfiese vectors by mechanical noeans into KAP-deficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) 

30 ballistic gold particle delivery, (iii) lqK)SomeHmediated transfection, (iv) receptor-mediated gene 
transier, and (v) tte use of DNA transposons (Morgan, R.A. and WJF. Anderson (1993) Annu. Rev. 
Biochem. 62:191-217; Ivies, Z (1997) CeU 91:501-510; Boulay, J-L. and H. Recipon (1998) Curr. 
ppm. BiotechnoL 9:445-450). 

Bxpression vectors tliat may be effective for the expression of KAP include, but are not 

35 limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX, PCR2-TOPOTA vectors 
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Gnvitrogen. Carlsbad CA). PCMV-SOUPT. PCMV-TAG, PEGSHZPERV (Stratagene, La JoUa 
CA), and PTBT-OHP, PTBT-ON. FraE2, PTRB2.LUC, PTK-HYG (Clontech, Palo Alto CA). 
KAP may be expressed using (i) a constituthrely active promoter, (e.g., from cytomegalovirus 
(CMV), Rous sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or p-actin genes), (ii) an 
5 inducible promoter (e.g., the t£tracyclin&-regulated promoter (Gossen, M. and H. Bujard (1992) 
Proc. Natl. Acad. Sci. USA 89:5547-5551; Gossen, M et al. (1995) Science 268:1766-1769; Rossi. 
FJMV. and HJM Blau (1998) Curr. Opin. Biotechnol. 9:451-456), commercially available m the 
T-REX plasmid (Livitrogen)); tte ecdysone-inducible promote (available in the plasmids PVGRXR 
and PIND; lavitrogen); the FK506^pamycin inducible promoter; or the RU486/mifepristone 

10 mducible promoter (Rossi, FJMV. and KM. Blau, supra) \ or (iii) a tissue-specific promoter or the 
native promoter of the endogenous gene ^coding KAP from a normal individual. 

(Commercially available liposome transformation Idts (e.g., the PEBFECT LIPID 
TRANSFBCnON KIT, available from Livitrogra) aUow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require mmimal effort to optimize experimental 

15 parameters. M the alt»iiative,transfonnation is perfocmed using thecal 

(Graham, F.L. and A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, R et al.. 
(1982) EMBO J. 1:841-845). The introduction of QNA to primary cells requires modification of 
these standardized mammalian transfection protocols. 

in another embodiment of the invention, diseases or disorders caused by genetic defects 

20 with respect to KAP expression are treated by constructmg a retrovirus vector consisting of (i) the 
polynucleotide encoding KAP under the control of an independent promote or the retrovirus long 
termmal repeat (LTR) promoter, (ii) appropriate RNA packagmg signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus cw-acting RNA sequences and coding sequ^ices 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 

25 commercially avaUable (Stratagene) and are based on published data (Riviere, L et al. (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by referwice herein. The vector is propagated m 
an appropriate vector producing cell line (VP(X) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Arm^tano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, MA. et al. (1987) J. Virol. 61:1639-1646; Adam, MA. and 

30 AD. MiUer (1988) J. VudL 62:3802-3806; DuU, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey. 
R. et al. (1998) J. Virol. 72:9873-9880). U.S. Patent No. 5,910,434 to Rigg CMefliod for obtaining 
retrovirus packaging cell lines producing hi^ transducing efficiency retroviral supernatant") 
discloses a method for obtaining retrovirus packaging cell lines and is hereby incorporated by 
reference. Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4* T- 

35 cells), and the return of transduced cells to a patient are procedures weU known to persons skilled in 
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die art of gene therapy and have been well documrated (Ranga, U. et al. (1997) J. Virol. 71:7020- 
7029; Bauear, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, ML. (1997) J. Virol. 71:4707-4716; 
Ranga. U. et al. (1998) Proc. Natl. Acad. ScL USA 95:1201-1206; Su, L. (1997) Blood 89:2283- 
2290). 

5 Id the alternative, an adenovinis-based gene dimpy delivery syst^ is used to deliv^ 

polynucleotides ^icoding KAP to cells which have one or more ^etic abnormalities with respect 
to the e^qnression of KAP. The construction and packaging of adraovirus-based vectors are well 
' known to those with ordinary skill in the art Replication defective adenovirus vectors have proven 
to be versatile for importing g^es ^ocoding imoDOLunor^^ulatQry proteins into intact islets in the 

10 pancreas (Csete, MB. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral 
vectors are described in U.S. Patent No. 5»707,618 to Armentano ("Adenovirus vectors for grae 
thei^y"), hereby incorporated by reference. For adraoviral vectors, see also Antinozzi, PA. et al. 
(1999) Annu. Rev. Nutr. 19:511-544 and Verma, LM. and N. Somia (1997) Nature 18:389:239-242, 
both incorporated by reference herein. 

15 In another alternative, a heipes^sased, gene therapy delivery system is used to deliv^ 

polynucleotides ^coding KAP to target ceUswUch have one or more genetic atmorma^ 
req)ect to the expression of KAP. The use of herpes simplex virus (HSV>based vectors may be 
especially valuable for mtrodudng KAP to cells of the central nervous system, for which HS V has a 
tropism. The construction and packagiog of herpes4>ased vectors are well known to those with 

20 ordinary skill in the art A replication-conq>etent herpes simplex virus (HSV) type 1-based vector 
has been used to deliver a reporter gene to the eyes of primates (liu, X. et al. (1999) Exp. Eye Res. 
169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent No. 5,804,413 to DeLuca CBerpes simplex virus strains for gene transfer^, which is hereby 
incorporated by reference. U.S. Patent No. 5,804,413 teaches the use of recombinant HSV d92 

25 which consists of a genome containing at least od& exogenous gene to be transferred to a cell under 
the control of the ajypropriate promc^r for purposes including human gene therapy. Also taught by 
this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and 
ICP22. For HSV vectors, see also Coins, WJ?. et al. (1999) J. Virol. 73:519-532 and Xu, R et al. 
(1994) Dev. Biol. 163:152-161, hereby rncorporated by reference. The manipulation of cloned 

30 herpesvirus sequraces, the generation of recombinant virus following the transfection of multiple 
plasmids containing difTeient segmrats of the large herpesvirus genomes, the growth and 
propagation of l^pesvirus, and the infection of cells with herpesvirus are techniques well known to 
those of ordinary skill in the ait 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 

35 deliver polynucleotides ^coding KAP to target cells. The biology of the prototypic alphavirus. 
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Sendiki Foiest ATirus (SFV), has been studied extensively and gene transfer vectors have heea based 
on the SFV genome (Garoff. H. and K,-J. U (1998) Cmr. 0pm. Biotechnol. 9:464469), During 
alphavinis RNA lepKcation, a subgenomic RNA is g^erated that nonnally encodes the viral capsid 

• proteins. This subgonomic RNA iqpUcates to U^ber levels than the fu^ 
5 resulting in the overproduction of capsid protdns relative to the viial proteins with enzymatic 
activity (e.g., protease and polymerase). Similarly, inserting the coding sequence for KAP into the 
alphavinis genome in place of the capsid-coding region results in the production of a large number 
of EAP-coding RNAs and die synthesis of high levels of KAP m vector transduced cells. While 
alphavinis infection is typically associated with cell lysis within a few days, the ability to establish a 

10 persistmt infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) 
indicates that the lytic reph cation of alphaviruses can be altered to suit tibe needs of the gene therapy 
plication Q>ryga« S A. et al. (1997) Virology 228:74-83). The wide host range of alphaviruses 
will allow the introduction of KAP into a variety of cell types. The specific transduction of a subset 
of cells in a pcq)iiIationii]ay require the sorting of cells prior to transdu^ Themethodsof 

IS manipulatmg infectious cDNA clones of alphaviruses, performing alphavinis cDNA and RNA 
transfections, and performing alphavinis infections, are well known to those with ordinary skill in 
the art 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 
-10 and +10 firom the start site, may also be eiiq>loyed to inhibit gene expression. Similarly, 

20 inhibition can be achieved using triple heUx base-pairing methodology. Triple helix pairing is 
useful because it causes inhibition of the ability of the double helix to open sufficiently for the 
binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances- 
using triplex DNA have been described in the litaature. (See, e.g.. Gee, JJB. et al. (1994) in Huber, 
BJB. and BX Cair, Molecular and Tinmi mologic ADproacbes. Futura Publishing, Mt Kisco NY, pp. 

25 163-177.) A con^lementary sequ^ce or antisense molecule may also be designed to block 
translation of mRNA by preventing the transcript firom binding to ribosomes. 

Ribozyn^ enzymatic RNA molecules, may also be used to catalyze the specific cleavage 
of RNA Tte mechanism of ribozyme action involves sequence-speci&c hybridization of the 
ribozyme molecule to conqplementary target RNA, followed by endonucleolytic cleavage. For 

30 exan^le, engineered hammertiead modf ribozyme molecules may specifically and efficiently 
catalyze endonucleolytic cleava^ of sequraces encoding KAP. 

Spedfic ribozyme cleavage sites within any potential RNA target are initially identified by 
scanniog the target molecule for ribozyme cleavage sites, including the foUowing sequences: GUA, 
GUU, and GUC Once id^tified, short RNA sequences of between 15 and 20 ribonucleotides, 

35 corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
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seccmdary stmcmn^ fealiiies which nmy lendCT tte The sidtability of 

candidate taigets may also be evaluated by testing accessibOity to hybridization with 
conq>lementaTy oligonucleotides using ribonuclease protection assays. 

Conqplementaiy ribonucleic acid molecules and ribozymes of the invention may be prepared 
5 by any method known in the art for the synthesis of nucleic add molecules. These include 
techniques for chemically synthesizing oligonucleotides such as solid phase phosphoramidite 
chemical synthesis. Alternatively, RNA molecules may be graerated by in vitro and in vivo 
transcription of DNA sequ^ces encoding KAP. Such DNA sequraces may be incorporated into a 
wide variety of vectors with suitable RNA polymerase promotm such as T7 or SP6. Alternatively, 

10 these cDNA constructs that synthesize con^lementary RNA, constitutively en* inducibly, can be 
introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the S' and/or 3* 
ends of the molecule, or the use of phosphorothioate or 2' 0-methyl rather than phosphodiesterase 

15 linkages within the backbone of the molecule. This conc^t is inhmnt in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as 
inosine, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms 
of adenine, cytidine, guanine, thymine, and uridine which are not as easily recognized by 
endogenous endonucleases. 

20 An additional embodiment of the invention encompasses a method for screening for a 

compound which is eflEective in altmng expression of a polynucleotide encoding KAP. Compounds 
which may be effective in altering e:q»ression of a specific polynucleotide may include, but axe not 
limited to, oligonucleotides, antisense oligonucleotides, trq)le helix-fomiing oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 

25 chemical entities which are enable of interactmg with specific polynucleotide sequences. Effective 
compounds may alter polynucleotide egression by acting as either inhibitors or promoters of 
polynucleotide egression. Thus, in die treatment of disorders associated with increased KAP 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 
encoding KAP may be therapeutically useful, and in the treatment of disorders associated with 

30 decreased KAP e^xression or activity, a conq>ound which specifically promotes expression of the 
polynucleotide ^coding KAP may be therapeutically useful 

At least one, and up to a plurality, of test compounds may be screened for efTectiveness in 
altering expression of a specific polynucleotide. A test con^und may be obtained by any method 
commonly known in the art, including chemical modification of a con^und known to be effective 

35 in altering polynucleotide expression; selection fiom an existing, commercially-available or 
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pioprietaiy library of natmally-occuiriDg or non-Datura] chemical compounds; rational design of a 
conq)Ound based on chemical and/or structoral properties of the target polynucleotide; and selection 
firom a library of chemical compounds created combinatorially or randomly. A sanq>le conqnising 
a polynucleotide encoding KAP is exposed to at least one test compound thus obtained. The sample 
5 may conqirise, for example, an intact or pcrmeabilized ceU, or an in vitro cell-fiee or reconstituted 
biochemical systentL Alterations in the e3q«ession of a polynucleotide ^coding KAP are assayed 
by any method commonly known in the art Typically, the e3q>res^on of a specific nucleotide is 
detected by hybridizaticm with a probe having a nucleotide sequence complementary to tire 
sequ^ce of the polynucleotide encoding KAP. The amount of hybridization may be quantijBed, 

10 thus forming the basis for a conq)arison of the e^quession of the polynucleotide both with and 
without exposure to one or more test compounds. Detectionof a changp in the expression of a 
polynucleotide exposed to a test compound indicates that the test compound is e^ctive in altering 
the esi»ession of the polynucleotide. A screen for a coiiq>ound efifective in altering expression of a 
specific polynucleotide can be carried out, for example, using a fehiyrvear charomvces pombe gme 

15 expression systrai (Atkms, D. et al. (1999) U.S. Patent No. 5,932,435; Amdt. GM. et al. (2000) 
Nucleic Acids Res. 28:E1S) or a human cell Ime such as HeLa cell (Clarke, NLL. et aL (2000) 
BiochmL Biophys. Res. Commun. 268:8-13). A particular embodiment of the present invention 
involves screening a combinatorial library of oligonucleotides (sudi as deoxyribonucleotides, 
ribonucleotides, peptide nucleic adds, and modified oligonucleotides) for antis^ise activity against 

20 a specific polynucleotide sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, 
T.W. et al. (2000) U.S. Patent No. 6,022,691). 

Many methods for introducing vectors into ceUs or tissues are available and equally suitable 
for use m vivo, in vitro, and ex vivo. For j^jrfvo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patioiit 

25 Delivery by transfection, by liposome injections, or by polycationic amino polymers may be 
achieved using metiiods which are well known in the art (See, e.g.. Goldman. CK et aL (1997) 
Nat. Biotechnol. 15:462-466.) 

Any of the ther^utic n^thods described above may be applied to any subject in need of 
such therapy, including, for exaiiQ)Ie, mammals such as humans, dogs, cats> cows, hoses, rabbits, 

30 and monkeys. 

An additional embodiment of the invention relates to the administration of a conoposition 
which generally conqmses an active ingredient formulated with a phannaceutically acceptable 
excipient Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. 
Various formulations are commonly known and are thoroughly discussed in the latest edition of 
35 RftiniTiptnn*s Pharmac eutical Sciences (Maack Publishing, Easton PA). Such coirq>ositions noay 
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consist of KAP, antibodies to KAP, and mimetics, agonisls, antagonists, or inhibitois of KAP. 

The compositions utilized in tbis invention may be administered by any number of routes 
including, but not limited to, oral, intravenous, intramuscular, intta-arterial, intramedullary, 
intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intrapOTtoneal. intranasal, 
5 enteral, topical, sublingual, or rectal means. 

Con?>ositions for pulmonary administration may be prepared in liquid or dry powder form. 
These compositions are generally aerosolized immediately prior to inhalation by the patient In the 
case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of 
fast-acting formulations is well-known m the art In the case of macromolecules (e.g. larg^ 
10 peptides and proteins), recent developments in the field of pulmonary delivery via the alveolar 
region of the lung have enabled tiie practical delivery of drugs such as insulin to blood diculatian 
(see, e.g., Patton, I.S. et al., U.S. Patent No. 5,997,848). Puhnonary deUvery has the advantage of 
administration without needle mjection, and obviates the need for potentially toxic penetration 
enhancers. 

15 Conqwsitions suitable for use in the invention include compositions wherein the active 

ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of fliose skilled in the art 

Specialized forms of conqwsitions may be prepared for direct intracellular delivery of 
macromolecules conq)rising KAP or fi:agmMits thereof For exanoqple, liposome preparations 

20 containing a cell-iBopermeable macromolecule may promote cell fusion and intracellular delivery of 
the macromolecule. Alternatively, KAP or a fragment tiiereof may be joined to a short cationic N- 
terminal pOTtion from die HIV Tat-1 protein. Fusion proteins thus generated have been found to 
transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, SiL 
et al. (1999) Science 285:1569-1572). 

25 For any co^^)ound, the tiierapeutically effective dose can be estinaated initially either in cell 

culture assays, e.g., of neoplastic cells, or in animal noodels such as mice, rats, rabbits, dogs, 
monkeys, or pigs. An animal model may also be used to determine the appropriate concentration 
range and route of admhustration. Such information can then be used to determine useful doses and 
routes for administration in humans. 

30 A theiapeuticaQy effective dose refers to tiiat amount of active ingredient, for example KAP 

or fragments thereof, antibodies of KAP, and agonists, antagonists or inhibitors of KAP, which 
ameliorales die syrrq)toms or condition. Thers^utic efficacy and toxicity may be determined by 
standard pharmaceutical procedures in ceD cultures or witii e;q)erimental animals, such as by 
calculating die ED50 (die dose tiierapeutically effective in 50% of tiie population) or LD^ (die dose 

35 letiial to 50% of die population) statistics. The dose ratio of toxic to dierapeutic effects is die 
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therapeudc index, which can be expressed as the LD51/ED50 ratio. Conq>ositi(»is which exhibit large 
therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such conipositions is 
preferably within a range of drculaling concentrations that includes the ED50 with little or no 
5 toxicity. The dosage varies within this range depending \spon the dosage form employed, the 
sensitivity of the patient, and the route of administratioh. 

The exact dosage wiU be det^mined by the practiticmer, in light of &ctors related to the 
subject requiring treatm^t Dosage and administration aie adjusted to provide sufficient levels of 
the active moiety or to maintain the desired effect Factors which may be taken hito account include 

10 the severity of the disease state, the general health of the subj ect, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 
response to ther^y. Ix)ng^actirigconq>ositi(msiiiaybeadministemiev^3 to4days,every wee^ 
or biweekly depending on the half-life and clearance rate of the particular formulatiQn. 

Normal dosage amounts may vary fiom about 0. 1 /ig to 100,000 Aig, iq> to a total dose of 

15 about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art 
Those skilled in the art will en^loy different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides wiD be specific to particular cells, 
conditions, locations, etc. 

20 DIAGNOSTICS 

In another embodiment, antibodies which specrBcally bind KAP may be used for the 
diagnosis of disorders characterized by expression of KAP, or in assays to monitor patients being 
treated with KAP or agonists, antagcousts, or inhibitors of KAP. Antibodies useful for diagnostic 
purposes may be prq>arBd in the same manner as described above for therapeutics. Diagnostic 
25 assays for KAP mclude methods which utilize the antibody and a label to detect KAP in human 
body fluids or in extracts of cells or tissues. The antibodies may be used with or without 
modification, and may be labeled by covalent or non-K:ovalent attachment of a reporter molecule. A 
wide variety of reporter molecules; several of which are described above, are known in the art and 
may be used. 

30 A variety of protocols f(x measuring KAP, including HTJSAs, RIAs, and FACS, are known 

in the art and provide a basis for diagnosing altered or abnormal levels of KAP expression. Normal 
or standard values for KAP expression are established by combining body fluids or cell extracts 
taken fiom normal mammalian subjects, for example, human subjects, with antibodies to KAP 
under conditions suitable for complex formation. The amount of standard con^lex formation may 

35 be quantitated by various methods, such as photometric means. Quantities of KAP expressed in 
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subject, control, and disease sanqples from faiqpsied tissues axe conq>aied with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing disease. 

hi anoth^ ^nbodiment of the invention, the polynucleotides encoding KAP may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 

5 complementary RNA and DNA molecules, and FNAs. The polynucleotides may be used to detect 
and quantify gene expression in biopsied tissues in which expression of KAP may be conelated 
withdisease. The diagnostic assay may be used to deterinine absence, presence, and excess 
expression of KAP, and to monit(»: reguladm of KAP levels during therapeutic intervention. 

Ld one aspect, hybridization witti PGR probes which are capable of detecting polynucleotide 

10 sequences, including genomic sequences, oicoding KAP or closely related molecules may be used 
to identify nucleic add sequences which encode KAP. The specificity of the probe, whether it is 
made from a highly specific region, e.g., the S' regulatory region, or from a less specific region, e.g., 
a conserved motif, and the string^cy of the hybridization or anqplification will determine whether 
the probe identifies only naturally occurring sequ«K:es encoding KAP, allelic variants, or related 

15 sequ0u:es. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the KAP encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:21-40 or 
from genomic sequences including promoters, enhancers, and introns of the KAP gene. 

20 Means for producing specific hybridization probes for DNAs encoding KAP include the 

cloning of polynucleotide sequences encoding KAP or KAP derivatives into vectors for the 
production of mRNA probes. Such vectors are known in ttie art, are commercially available, and 
may be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 

25 variety of reporter groups, for example, by radionuclides such as ^ or ^S, or-by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding KAP may be used for the diagnosis of disorders 
associated with expression of KAP. Exaiqples of such disorders include, but are not limited to, a 
cardiovascular disorder such as artmovenous fistula, atherosclerosis, hypertension, vasculitis, 

30 Raynaud's disease, aneurysms, artmal dissections, varicose veins, thrombophlebitis and 
phlebothrombosis, vascular tmiKsrs, and complications of thrombolysis, balloon angpioplasty, 
vascular replacement, and corcmary artery bypass graft surgery, congestive heart failure, ischemic 
heart disease, angina pect(»is, myocardial infarction, hypertensive heart disease, degenerative 
valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, mitral 

35 annular calcification, mitral valve prolapse, rheumatic fever and rheumatic heart disease, infective 
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endocaiditis, nonbacterial tbrombotic endocarditis, endocaiditis of systemic lupus eiythematosus, 
carcinoid heart disease, cardiomyopatliy, myocarditis, pericarditis, neoplastic heart disease, 
congenital heart disease, and conq>lications of cardiac transplantation, congenital lung anomalies, 
atelectasis, pulmonary congestion and edema, puhnonary embolism, pulmonary hemorrhage, 
5 pulmonary infarction, pulmcHiary hypertension, vascular sclerosis, obstructive pulmonary disease, 
restrictive pulmonary disease, chronic obstructive pulmonary disease, emphysema, chronic 
bronchitis, bronchial asthma, bronchiectasis, bactmal pneumonia, viral and mycoplasmal 
pneuinonia, lung abscess, pulnionary tuberculosis, difhise interstitial dise^ 
sarcoidosis, idiopathic puhnonary jBhrosis, desquamative interstitial pneumonitis, hypersensitivity 

10 pneumonitis, pulmonary eosinophilia bronchiolitis obliterans-cnrganizing pneumonia, difiiise 
pulmonary hemoofaage syndromes, Goodpasture's syndromes, idiopathic puhnonary hemosiderosis, 
puhnonary involvement in coUagen-vascular disorders, puhnonary alveolar proteinosis, lung tdmors, 
inflammat ory and n oninflamm atory pleural efiusions, pneumothorax, pleural tumors, drug-induced 
lung disease, radiation-induced lung disease, and conq>lications of lung transplantation; an immune 

15 disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory 
distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anonia, asthma, adierosclerosis, 
autoimmune hemolytic anemia, autoimmune thyroiditis, autoimrrume polyendocrinopathy- 
candidiasis^todermal dystrophy (APECED), bronchitis, cholecystitis, contact demiatitis. Crohn's 
disease, atopic dermatitis, deraaatomyositis, diabetes mellitus, emphysema, episodic lynq)hopenia 

20 with lynq)hocytotoxins, erythroblastosis fistalis, erythema nodosum, atrophic gastritis, 
•glomerulonephritis, Goodpasture's syndrome, gout. Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 
syndronte. rheumatoid arthritis, sclerodenna, SjDgren's syndrome, systemic anaphylaxis, systemic 

25 lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, 
Werner syndrome, conq)lications of cancer, hemodialysis, and extracorporeal circulation, viral, 
bacterial, fungal, parasitic, protozoal, and hehninthic infections, and trauma; a neurological disorder 
such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's 
disease. Pick's disease, Huntington's disease, dran^tia, Parkinson's disease and other 

30 extrapyramidal disordea:s, amyotrophic lateral sclerosis and other motor neuron disorders, 

progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and 
other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, 
epidural abscess, suppurative mtracranial thrombophlebitis, myelitis and radiculitis, viral central 
nervous system disease, prion diseases mcluding kuru, Creutzfeldt-Jakob disease, and Gerstmann- 

35 Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
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nervous system, neuiofibioinatosis, tuberous sclerosis, cerebelloretmal heiiiangioblastoiiiatosis, 
encephalobigeminal syndrome, mental retardation and oth^ developmental disorders of the central 
nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 
nervous system disorders, cranial n^e disorders, spinal cord diseases, muscular dystrophy and 
5 other neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and 
polymyositis, mherited, metabolic, endocrine, and toxic myopathies, myastb^ua gravis, periodic 
paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, seasonal affective 
disord^ (SAD), akathe^a, amnesia, catatonia, diabetic neuropafliy, tardive dyskmesia, dystonias, 
paranoid psychoses, postherpetic neuralgia, Tourette's disord^, progressive supranuclear palsy, 

10 corticobasal deg^eration, and familial firontot^Doporal dementia; a growth and developmental 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, drriiosis, hepatitis, 
mixed connective tissue disease OMLCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 
polycythemia vm, psoriasis, primary flm>inbocythemia, renal tubular acidosis, anemia, Cushing*s 
syndrome, achondroplastic dwarfism, Duchenne and Becker muscular dystrophy, epilepsy, gonadal 

15 dysgenesis, WAGR syndrome (Wilms' tmnor, aniridia, genitourinary abnormalities, and mmtal 
retardaticm), Smith-Magaais syndrome, myelodysplastic syndrome, heteditaiy mucoepithelial 
dysplasia, hereditary keratodermas, hereditary neuropathies such as C3iarcot-Marie-Tooth disease 
and neurofibromatosis, hypothyroidism, hydrocephalus, seizure disorders such as Syndenham's 
chorea and cerebral palsy, spina bifida, anencq>haly, craniorachischisis, congenital glaucoma, 

20 cataract, and sens(^eural hearing loss; a lipid disorder such as &tty liv^, cholestasis, primary 
biliary cirrtiods, camitme deficiency, camitnie palmitoyltransferase defidmcy, myoadenylate 
deaminase deficiency, hypertriglyceridemia, lipid storage disorders such Fabry's disease, Gaucher's 
disease, Niemann-Pick's disease, metachromatic leukodystrophy, adrenoleukodystrophy, GMj 
gangliosidosis, and ceroid lipofuscinosis, abetalq)oprotNnemia, Tangier disease, 

25 hyperiipq[iroteinemia, diabetes mellitus, Iqxxlystrcqthy , lipomatoses, acute panniculitis, 

disseminated fat necrosis, adiposis doloiosa, lipoid adrenal hyperplasia, mmimal change disease, 
lipomas, atteroscl^sis, hypercholesterolemia, hypercholesterolemia with hypertriglyceridemia, 
primary hypoatphalipoproteinemia, hypoAyrddism, raial disease, liv^ disease, lecithin:diolesterol 
acyltransfi^ase deficiency, cerebrotendinous xanthomato^, atosterolerrda, hypochdestmlemia, 

30 Tay-Sachs disease, Sandhofif s disease, hyperiipid^nia, hyperlipemia, lipid myopathies, and 
obesity; and a cell proliferarive disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, 
bursitis, ckrhosis, hepatitis, mixed connective tissue disease (M CTD), myelofibrosis, paroxysmal 
nocturoal hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancm 
includii^ adenocarcinoma, leukemia, lynq>homa, melanoma, myeloma, sarcoma, teratocarcinoma, 

35 and, in particular, cancers of the adroaial gland, bladder, bone, bone manow, brain, breast, cervix. 
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gall bladd^, ganglia, gastcointestmal tract, heart, kidney, liver, lung, muscle, ovaiy, pancreas, 
parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, uterus, leukemias 
such as multiple myeloma, and lymphomas such as Hodgkin's disease. The polynucleotide 
sequences encoding KAP may be used in Southern or northern analysis, dot blot, or other 
5 membrane-based technologies; in PGR technologies; in dipstick, pin, and multiformat EUSA-like 
assays; and m microarrays utilizing fluids or tissues from patients to detect altered KAP e;q>ression. 
Such qualitative or quantitative methods are well known in tfie ait 

In a particular aspect, the nucleotide sequences encoding KAP may be useful m assays that 
detect the presence of assodated disorders, particukoiy those mentioned above. The nucleotide 

10 sequences encoding KAP may be labeled by standard metiiods and added to a fluid or tissue san^le 
from a patient under craditions suitable for the formation of hybriffization conq)lexes. After a 
suitable incubation period, the sample is washed and the signal is quantified and conqpaied with a 
standard value. Ifthe amount ofsignd in the patient san^le is significantiyalteml in conq>ari 
to a control sanf>le then the presence of altered levels of nucleotide sequences encoding KAP in the 

15 sample in d icates the presence of the associated disorder. Such assays may also be used to evaluate 
the efBcacy of a particular tiierapeutic treatment legimra in animal studies, in clinical trials, or to 
monitor the treattnrait of an individual patirat 

In ord^ to provide a basis for the diagnosis of a disorder associated with expression of 
KAP, a normal or standard profile for expression is established. This may be accomplished by 

20 combining body fluids or cell extracts taken from normal subjects, eidier animal or human, with a 
sequence, or a fragment thereof, encoding KAP, und^ conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by conq>aring the values obtained from 
normal subjects with values fiom an e^q>eriment in which a known amomt of a substantially 
purified polynucleotide is used. Standard values obtained in this mann^ may be con^ared with 

25 values obtained fixmisaiiq}les fiom patients who are syniq>tomatic for a disorder Deviationfrom 
standard values is used to establish the presence of a disorder. 

Once the presence of a discnder is established and a treatm^t protocol is initiated^ 
hybridization assays may be rq)eated on a regular basis to determine if the level of expression in the 
patient begins to ^)im>ximate that which is observed in the normal subject The results obtained 

30 fi^m successive assays may be used to show the efBcacy of treatmottt over a period ranging from 
several days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biq)sied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the 

35 appearance of actual clinical synq>toms. A more definitive diagnosis of this type may allow health 
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professionals to eniploy preventative measures or aggressive treatment earlier thereby preventing 
the development or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides dedgned from the sequences encoding KAP 
may involve the use of PGR. These oligomers may be chemically synthesized, generated 
5 enzymatically, or produced in vitro . Oligomers will preferably contain a fiagment of a 
polynucleotide encoding KAP, or a fragment of a polynucleotide conq)lementaiy to the 
polynucleotide encoding KAP, and will be aq>loyed under optimized conditions for identification 
of a specific gene or condition. Oligomers may also be enq)loyed under less stringent conditions for 
detection or quantification of closdy related DNA or RNA sequimces. 

10 In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 

encoding KAP may be used to detect single nucleotide polymorphisms (SNPs), SNPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic 
disease in humans. Methods of SNP detection include, but are not limited to, single-stranded 
confonnation polymorphism (SSCP) and fluorescent SSCP(£SSCP) methods. InSSCP, 

15 oligcmucleotide primers derived from the polynucleotide sequences encoding KAP are used to 
anq)lify DNA using the polymnase chain reaction (PGR). The DNA may be derived, for exanqile, 
fromdiseasedornormaltissue^bippsysan^les, bodily fluids, and the like. SNPs m the DNA cause 
difTerences in the secondary and tertiary structures of PGR products in smgle-stranded form, and 
these differences are detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the 

20 oligonucleotide primers are fluorescently labeled, which allows detection of &e amplimers in hi^ 
throughput equipment such as DNA sequencing machines. Additionally, sequence database 
analysis methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by 
conq>armg the sequence of individual overlapping DNA fragments which assemble into a conmaon 
consensus sequence. These conq)uter-based methods filter out sequence variations due to 

25 laboratory preparation of DNA and sequencing eirors using statistical models and automated 
analyses of DNA sequence chromatograms. hi the alternative, SNPs may be detected and 
characterized by mass spectrometry usmg, for example, the high througlq)ut MASSARRA Y system 
(Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of KAP include radiolabeling 

30 or biotinylating nucleotides, coanq>lification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C 
et al. (1993) Anal. Biochan. 212:229-236.) The speed of quantitation of multiple sanq)les may be 
accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide 
of interest is presrated in various dilutions and a spectrophotometric or coloiin^tric response gives 

35 rapid quantitation. 
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In forfter einbodiioents, oligonucleotides or long^ firagments derived fironi any of the 
polynucleotide sequraces described hmin may be used as elements on a microarray. The 
nodcroarray can be used in transcript imaging techniques which monit<»: the relative expression 
levels of large numbers of genes simultaneously as described below. The microarray may also be 
5 used to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to 
monitor progressionAegiession of disease as a function of gene e}q)ression, and to develop and 
monitortiieactivitiesof ther^uticagmts in tiietreatmcait of disease. ]b particular, tiiis 
information may be used to develop a pharmacogenomic profile of a patient in order to select the 

10 most appropriate and efGoctive treatment regimen for tiiat patient For example, therapeutic agents 
which are highly effective and display the fewest side effects may be selected for a patient based on 
his/her pharmacQgenomic profile. 

In another embodiment, KAP, firagments of KAP, or antibodies specific for KAP may be 
used as elements on a microarray. The rnicroaiiay ntmy be used to monitor or rneasure protein- 

IS protein interactiws, drug-4arget interactions, and g^ expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 
generate a transcript image of a tissue or cell type. A transcript image represents the global pattern 
of gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed 
by quantifying the number of expressed genes and their relative abundance und^ given conditions 

20 and at a given time. (See Seilhamer et al., **Conq>arative Gene Transcript Analysis,** U.S. Patent 
No. 5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be 
generated by hybridizing the polynucleotides of the present invention or their coixq)lements to the 
totality of transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, 
the hybridization takes place in high-throughput format, wherein the polynucleotides of the present 

25 inventicm or their con^lem^ts conqmse a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, 
biopsies, or oth^ biological sauries. The transcript image may thus reflect g^e expression m 
vivo> as in the case of a tissue or biopsy sample, or in vitro, as in the case of a cell line. 

30 Transcript images which profile the expression of the polynucleotides of the present 

invration may also be used in conjuncticm with in vitro model systems and preclinical evaluation of 
pharmaceuticals, as well as toxicological testing of industrial and naturaUy-occurring environmental 
conq)ounds. All conQX>unds induce characteristic gene expression patterns, fiequendy termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and 

35 toxicity (Nuwaysir, EJF. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Anderson 
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(2000) Toxicol. Lett 112-113:467-471, expressly incorporated by reference herein). If a test 
conqx>und has a signature similar to that of a compound with known toxicity, it is likely to share 
those toxic properties. These fingeipiints or signatures are most useful and refined when they 
contain expression infonnation from a largp number of genes and gene families. Ideally, a g^om&- 
5 wide measurement of e^qxression provides the highest quality signature. Even genes whose 
expression is not alt^ed by any tested conq>ounds are important as well, as the levels of expression 
of these genes are used to nonnalize the rest of the e^qniession data. The nonnalization procedure is 
useful for cQnq)arison of e^qxression data after treatment with difE^ Whilethe 
assignment of gene function to elements of a toxicant signature aids in interpretation of toxicity 

10 mechanisms, knowledge of g^ie function is not necessary for the statistical matriimg of signatures 
which leads to prediction of toxicity. (See, far exanqxle. Press Release 00^)2 fiom the National 
histitute of Environn^ntal Health Sciences, released February 29, 2000, available at 
ht^://www,niehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is inqxntant and desirable in 
toxicological screening using toxicant signatures to include all expressed gene sequences. 

15 In one embodiment, the toxicity of a test conq>ound is assessed by treating a biological 

sample containing nucleic acids with the test compound. Nucleic acids that are expressed in the 
treated biological sanq>le are hybridized with one or more probes specific to the polynucleotides of 
the present invention, so that transcript levels coiresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are conqmred 

20 with levels in an untreated biological sanq)le. Differences in the transcript levels between the two 
samples arc indicative of a toxic response caused by the test conq>ound in the treated sanq)le. 

Another particular embodiment relates to the use of the polypeptide sequences of the 
present invention to analyze the proteome of a tissue or cell type. The term pn>te(Hne refers to the 
global pattern of protein ejqjression in a particular tissue cell ^pe. Each protein conqKxnent of a 

25 proteome can be subjected individually to further analysis. Proteome expression patterns, or 
profiles, are analyzed by quantifying the number of eTqnressed proteins and their relative abundance 
under given conditions and at a given time. A profile of a cell's pioteon» may thus be generated by 
separating and analyzing the polypeirtides of a particular tissue or cell type. In one embodin^nt, the 
separation is achieved using two-dimensional gel electrophoresis, m which protenis ftom a sanq)le 

30 arc separated by isoelectric focusing in the first dimension, and then according to molecular weight 
by sodium dodecyl sulfate slab ^1 electrophwesis in the second dimension (Steiner and Anderson, 
supra). The proteins are visualized in the gel as discrete and uniquely positioned spots^ 
staining the gel with an agent such as Coomassie Blue or sflver or fluorescent stains. The optical 
density of each protein spot is generally proportional to the level of the protein in the sample. The 

35 optical densities of equivalently positioned protein spots ficom differcnt samples, for example, from 
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Inological samples dtfaer treated or untreated with a test compound or Oierapeutic agent, are 
coiiq>aredtoidentify any changes iQ protein spot density related to tt^ The proteins in 

the spots are partially sequraced using, for exanq)le, standard methods enq)loying chemical or 
enzymatic cleavage foUowed by mass spectrometry. The identity of the protein in a spot may be 
5 determined by comparing its partial sequence, preferably of at least 5 contiguous amino acid 
residues, to the polypeptide sequences of the present invration. Jn some cases, further sequence 
data may be obtained for definitive protein identification. 

A proteomic profile may also be go&erated using antibodies specific for ElAP to quantify the 
levels of KAP expression. In one embodiment, die antibodies are used as elements on a microanay, 

10 and protein eTcpression levels are quantified by exposing the microarray to the sanoqple and detecting 
the levels of protein bound to each array element (Luekmg, A. et al. (1999) Anal. Biochem. 
270:103-111; Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be perftMcmed 
by a variety of methods known in the art, for exanqple, by reacting tte proteins in the sair^le with a 
thiol- c»: amino-readive fluorescent conq>ound and detecting the amount of fluorescence bound at 

15 each array element 

Toxicant signatures at the proteome level are also useful far toxicological screening, and 
should be analyzed in parallel with toxicant signatures at the transcript level. There is a poor 
correlation between transcript and protein abundances for some proteins in some tissues (Anderscm, 
N and J. Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be 
20 useful in the analysis of conq>ounds which do not significantly affect the transcript image, but 
which alter the proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, 
due to rapid degradation of mRNA, so proteomic profiling may be more reliable and informative in 
such cases. 

In another embodiment, the toxicity of a test conqmund is assessed by treating a biological 
25 sample containing proteins with the test coi£q;K>und. Proteins that are expressed in the treated 

biological san^le are separated so that the amount of each protein can be quantified. The amount of 
each protein is conq)ared to the amount of the coneqxmding protein in an untreated biological 
san^le. A difference in die amount of protein between the two sainples is indicative of atoxic 
response to die test coii^>ound in the treated san^le. hidividual proteins ate identified by 
30 sequencing the amino acid residues of the individual proteins and comparing these partial sequences 
to the polypq>tides of the preset invention. 

In anoth^ embodiment, tiie toxicity of a test conqx)und is assessed by treating a biological 
san[Q>le containing proteins with the test confound. Proteins from the biological san^le are 
incubated with antibodies specific to the polypeptides of the present invention. The amount of 
35 protein recognized by the antibodies is quantified. The amount of protein in the treated biological 
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san^le is coixq>aied with the aiDOunt in an imtreated biolo^ A difference in the amount 

of protein between the two sauries is indicative of a toxic response to the test confound in the 
treated sanq>le. 

Nficroarrays may be prepared, used, and analyzed using methods Im (See, 
5 e.g.. Brennan, T.M. et aL (1995) US. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. NatL 
Acad. Sci. USA 93:10614-10619; Baldeschweiler et al. (1995) PCT appUcation W095/251116; 
Shalon, D. et al. (1995) PCT application WO95/35505; HeDer, R.A. et al. (1997) Proc. Natl. Acad. 
Sci. USA 94:2150-2155; and Heller, MJ. et al. (1997) U.S. Patent No. 5,605,662.) Various types of 
micfoarrays are well known and thcmmgbly described in DNAMicroanravs: A Practical Approack 
10 M. Schena, ed. (1999) Oxford University Press, London, h^by e^ressly incorporated by 
reference. 

hi another embodiment of the invaition, nucleic acid sequences encoding KAP may be used 
to generate hyhndizatiaa probes useful in mapping the naturally occurring genomic sequence. 
Either coding or ncmcoding sequences may be used, and in some instances, noncoding sequences 

15 may be preferable over coding sequences. For example, cons^ation ofa coding sequence among 
members of a multi-gene family may potentially cause undesired cross' hybridization during 
chromosomal msqsping. Thesequencesiiiaybeinappedtoai>articularchroinosome,toasped^ 
region of a chromosome, or to artificial chromosome constructions, e.g., human artificial 
chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes 

20 (BACs), bacterial PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, 
J.J. et al. (1997) Nat Genet 15:345-355; Price. CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. 
(1991) Trends Genet 7: 149-154.) Once mapped, the nucleic acid sequoices of the invention may 
be used to develop genetic linkage maps, for exanq>le, which correlate the inheritance of a disease 
state with the inheritance of a.particular chromosome region or restriction firagment length 

25 polymorphism (RFLP). (See, for example, Land^, E.S. and D. Botstein (1986) Proc. Natl, Acad. 
Sd. USA 83:7353-7357.) 

Fluorescent in situ hybridization (PISH) may be correlated with other physical and genetic 
map data. (See, e.g., Heinz-Ulrich, et aL (1995) in Meyers, supra, pp. 965-968.) Exan^les of 
genetic map data can be found in various scientific journals or at the Online Mendelian inheritance 

30 in Man (OMM) World Wide Web site. Correlation between the location of the gpne encoding 
KAP on a physical map and a specific disorder, or a predisposition to a specific diswier, may hdp 
define the region of DNA associated with that disorder and thus may further positional cloning 
efforts. 

hi situ hybridization of chromosomal preparations and physical mapping techniques, such as 
35 linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
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Often the placement of a g^e on the chmmosome of another maminaB an species* such as mouse, 
may leveal associated maiicers even if the exact chromosomal locus is not known. This information 
is valuable to investigators searching for disease genes using positional cloning or other gene 
discovery techniques. Once the gene or genes responsible for a disease or syndrome have been 

5 crudely localized by genetic linkage to a particular genomic xegion, e.g., ataxia-telangiectasia to 
1 lq22-23, any sequraces mapping to that area may re^nesent associated or regulatory genes for 
further investigation. (See, e.g., Gatti, RA. et aL (1988) Nature 336:577-580.) The nucleotide 
sequence of the mstant invention may also be used to detect differences in ttse chromosomal 
location due to translocation, inversion, etc., among normal, carrier, or affected individuals. 

10 hi another embodiment of the invention, SlAP, its catalytic or immunogenic IGragments, or 

oligopeptides thereof can be used for screening libraries of conq>ounds in any of a variety of drug 
screemng techniques. The fragment en^loyed in such screening may be fiiee in solution, afGxed to 
a solid support, borne on a ceU surfoce, or located intracelhilarly. The formation of binding 
con^lexes between KAP and the agrat bemg tested may be measured. 

15 Another technique for drug screening provides for high throughput soeming of conopounds 

havmg suitable binding affinity to the protein of interest (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) Li this method, large numbm of different small test compounds are 
synthesized on a solid substrate. The test conq)ounds are reacted with KAP, or fragments thereof, 
and washed. Bound KAP is then detected by methods weD known in the art Purified KAP can also 

20 be coated direcUy onto plates for use in the aforementicmed drug screening techniques. 

Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
soUd support 

In another embodiment, one may use conf^titive drug screening assays in which 
neutralizing antibodies capable of binding KAP specifically conq)ete with a test conqwund for 
25 bindingKAP. hi this manner, antibodies can be used to detect the presence of any peptide which 
shares one or more antigenic determinants with KAP. 

In additional embodiments, the nucleotide sequences which encode KAP may be used in 
any molecular biology techniques that have yet to be developed, provided the new techniques rely 
on prop^es of nucleotide sequences that are currently known, including, but not limited to, such 
30 properties as the triplet gos^c code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, usiag the preceding 
description, utilize the present invention to its fullest extent The following embodiments are, 
therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure 
in any way whatsoever. 

35 The disclosures of all patents, applications and publications, mentioned above and below. 
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including U.S. Ser. No. 60/254,034. U.S. Ser. No. 60/255.756, U.S. Ser. No. 60/251.814. U.S. Ser. 
No. 60/256.172. U.S. Scsr. No. 60/257,416. U.S. Ser. No. 60/260,912, U.S. Ser. No. 60/264.344. and 
U.S. Ser. No.60/266,017. are expressly incorporaled by reference herein. 

5 EXAMPLES 
L Construction of cDNA Libraries 

L[icyte cDNAs derived from cDNA libraries described in the LIFESEQ GOLD 
database (Incyte Genomics, Palo Alto CA). Some tissues were homogenized and lysed in 
guanidinium isothiocyanate, while oth^ were homogenized and lysed in phenol or in a suitable 

10 mixtuie of denaturants. such as TRIZOL QLif e Technologies), a monqphasic solution of phenol and 
guanidine isothiocyanate. The resulting lysates centrifuged over CsQ cushions or extracted 
with chlorof omL RNA was precipitated from the lysates with either isopropanol or sodium acetate 
and ethanol, or by otiier routine methods. 

Phenol extraction and precipitation of RNA wm repeated as necessary to increase RNA 

15 purity. In some cases, RNA was treated witib DNase. For most libraries, poly(A>f RNA was 
isolated using oligo d(T)-coupled paramagnetic particles (Promega), OUGOTEX latex particles 
(QIAGEN, Chatsworth CA), or an OUGOTEX mRNA purification kit (QIACffiN). Alternatively, * 
RNA was isolated direcfly from tissue lysates using other RNA isolation kits, e.g., the 
POLY(A)PURE mRNA purification kit (Ambion, Austin TX). 

20 In some cases, Stratagene was provided with RNA and constructed the corresponding 

cDNA libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the 
UNIZAP vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), usmg 
the recommended procedures or similar methods known in the art (See, e.g., Ausubel, 1997, supra. 
units 5.1-6.6.) Reverse transcription was initiated using oligo dCT) or random primes. Synthetic 

25 oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with 
the appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected 
(300-1000 bp) using SEPHACRYL SIOOO, SEPHAROSE CL2B, or SEPHAROSE CLAB column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. 
cDNAs were lighted into conq)atible restriction ^izyme sites of the polylinker of a suitable plasmid, 

30 e.g., PBLUESCRffT plasmid (Stratagene), PSPORTl plasmid (Life Technologies), PCDNA2.1 
plasmid (lavitro^ Carlsbad CA), PBK-CMV plasmid (Stratagene), PCR2-TOPOTA plasmid 
(Ittvitrogen), PCMV-IOS plasmid (Stratagene), plGEH (hicyte Genomics, Palo Alto CA). pRARE 
(Incyte Genomics), or pINCY (Incyte Genomics), or derivatives thereof. Recombinant plasmids 
were transfomaed into con5)etent Rcoli cells including XLl-Blue, XLl-BlueMRF, or SOLR from 

35 Stratagpne or DHSa, DHIOB, or ElectroMAX DHIOB from life Technologies. 
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n. bdationofcDNAaones 

Plasmids obtained as described in Exan]ple I were recovered fiom host cells by in vivo 
excision using the UNIZAP vector system (Stratagene) or by cefl lysis, Plasmids were purified 
using at least one of the following: a Magic or WIZARD Nfinipieps DNA purificaticm system 
S (Promega); an AGTC Mmipiep purification kit (Edge Biosystems. Gaithersburg MD); and 
QIAWELL 8 Plasmid, QIAWELL 8 Phis Plasmid. QIAWELL 8 Ultra Plasmid purification systems 
or the RJB. AL. PREP 96 plasmid purification kit from QIAGEN. Following precipitatian. plasmids 
wefe resttspended in 0.1 ml of distilled water and stored, with or without lyophilization, at 4''C. 

Alternatively, plasmid DNA was ampUfied fiom host cell lysates using direct link PCR in a 

10 high-throughput fcwmat (Rao, VM, (1994) Anal. BiochraL 216: H4). Host cell lysis and thramal 
cycling steps were carried out in a single reaction mixture. Saiiq>les were processed and stcned ia 
384-well plates, and the concratraticm of anq>lified plasmid DNA was quantified fluorometrically 
using PICXXaOEEN dye (Molecular Probes, Bugraie OR) and a FLUOROSKAN n fluorescence 
scanner (Labsystems Oy, Helsinki, Finland). 

15 HL Sequencing and Analysis 

hicyte cDNA recovered m plasmids as described in Bxsassple U were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal 
cycler (MJ Research) m conjunction with the HYDRA microdispenser (Robbins Scientific) or the 

20 MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prq>aied 
using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such 
as the ABI PRISM BIGDYE TOTninator cycle sequoicing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
were carried out using the MEGABACaS 1000 DNA sequMicmg system (Molecular Dynamics); the 

25 ABI PRISM 373 or 377 sequencing system (AppHed Biosystems) in conjunction with standard ABI 
protocols and base calling software; or other sequ^ce analysis systems known in the art Reading 
firames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 
1997, supra, unit 7.7). Some of the cDNA sequences wrae selected for extension using the 
techniques disclosed in Exaiz^>le VUL 

30 The polynucleotide sequences doived fiom Incyte cDNAs were validated by removing 

vector, linker, and poly(A) sequaices and by masking ambiguous bases, usmg algorithms and 
programs based on BLAST, dynamic programmmg, and dinucleotide nearest neighbor analysis. 
The Incyte cDNA sequences or translaticms thereof were then queried against a selection of public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, 

35 and BLOCKS. PRINTS, DOMO, PRODOM; PROTEOME databases with sequences from Homo 
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sapiens. Rattus norvegicus, Mns iniiRfinlng- Caenotfaabditis elegans, Saccharomvces CCTevisiae, 
fehiznoflrrha romvces pombe> and r«nii^Ha ftlhirjina (Incyte Genomics, Palo Alto CA); and hidden 
Markov model (HMM)-based protein family databases such as PFAM. (HMM is a probabilistic 
approach which analyzes consensus primary structures of gene families. See, for example, Eddy, 
5 S JL (1996) Curr. Opin. Struct Biol. 6:361-365.) The queries were peifonned using programs 
based on BLAST, PASTA, BLIMPS, and HMMER. The hicyte cDNA sequences were assembled 
to produce full length polynucleotide sequeaices. Alternatively, GenBank cDNAs, GenBank ESTs, 
stitched sequences, stretched sequences, or Grascan-predicted coding sequences (see Examples IV 
and V)wm used to extend hicytecDNAasseniblages to fiiU length. Assenibly was performed 

1 0 using programs based on Phred, Phrap, and Consed, and cDNA assemblages were screened for open 
reading franies using programs based on (jeneMark, BLAST, an^ The full length 

polynucleotide sequences were translated to derive the corresponding full Iragth polypeptide 
sequences. Alternatively, a polyp^tide of the inv^tion may begin at any of the methionine 
residues of the fidl length translated polypq^de. Fiill length polypq>tide sequences were 

15 subsequently analyzed by querying against databases such as the GenBank protein databases 
(goapept), Swis&Prot, tte PROTEOME databases, BLOCKS, KUNTS, DOMO, PRODOM. Rrosite, 
and hidden Markov model (HMM)-based protein 6anily databases such a^ Full length 

polynucleotide sequences are also analyzed usmg MACDNASIS PRO software (Hitachi Software 
l^gmeermg. South San Htancisco CA) and LASERGENE sofbKrare (DNASTAR). Polynucleotide 

20 and polypeptide sequence alignments are generated usmg default parameters specified by the 
CLUSTAL algorithm as incoiporated into the MEGALIGN multisequence alignment program 
0)NASTAR), which also calculates the percent identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and ass^nbly 
of Incyte cDNA and full l^gth sequences and provides applicable descriptions, references, and 

25 threshold parameters. The first colunm of Table 7 shows the tools, pn>graiiis, and algorithms used, 
the seccmd column provides brief descriptions thereof, the third column presets appropriate 
references, all of which are incorporated by reference herein in their entirety, and the fourth column 
presents, where applicable, the scores, probability values, and other parameters used to evaluate the 
strength of a match between two sequences (the higher the score or the lower the probability value, 

30 the greater the identity between two sequences). 

Tht programs described above for the assembly and analysis of full length polynucleotide 
and polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ 
ID NO:21>40. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization 
and an^lification technologies are described in Table 4, column 2. 

35 IV. Identification and Editing of Coding Sequences from Genomic DNA 
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Putative kmases and phosphatases vfm initiany idoatified by nmning the Genscan gene 
identification program against public genomic sequence databases (e.g., gbpii and gbhtg). Genscan 
is a general-purpose g^e identification program which analyzes gnomic DNA sequences from a 
variety of organisms (See Burge, C and S. Karlin (1997) J. MoL Biol. 268:78-94. and Burge. C. and 
5 S. Karlin (1998) Curr. OpiiL Struct Biol. 8:346-354). The program concatenates predicted exons to 
form an assembled dDNA sequence extrading from a methicxiine to a stop codon. The output of 
Genscan is a PASTA database of polynucleotide and polypeptide sequences. The maxinriinn range 
of sequence for Genscan to analyze at once was set to 30 ld>. To detmnme which of these Genscan 
predicted cDNA sequences encode kuoAses and phosphatases, the encoded polypeptides were 

10 analyzed by querying against PPAMinodels for kmases arid phosphatases. Potential kinases and 
phosphatases were also identified by homology to hicyte cDNA sequ^ces that had been aimotaled 
as kinases and phosphatases. These selected Genscan-predicted sequences were then compared by 
BLAST analysis to the genpept and gbpri public databases. Where necessary, the Genscan- 
predicted sequences were then edited by comparis(xi to the top BLAST hit from genpept to correct 

15 errors in the sequence predicted by Genscan, such as extra or oimtted exons. BLAST analysis was 
also used to find any Incyte cDNA or public cDNA coverage of the Genscan-predicted sequmces, 
thus providing evidence far transcription. When Incyte cDNA coverage was available, this 
information was used to correct or confirm the Genscan predicted sequence. Full length 
polynucleotide sequences were obtained by assembling Genscan-predicted codmg sequences with 

20 Incyte cDNA sequences and/or public cDNA sequences using the assembly process described in 
Exanq)le EL Altmiatively , full l&ngth polynucleotide sequences were derived entirely fix>m edited 
or unedited G^iscan-predicted coding sequences. 

V. Assembly of Genomic Sequence Data with cDNA Sequence Data 
"Stitched" Sequences 

25 Partial cDNA sequences were extended with exons predicted by the Genscan gene 

identification program described in Exanq>le IV. Partial cDNAs assembled as described in Exan^le 
in were mapped to gonomic DNA and parsed into clusters containing related cDNAs and Genscan 
exon predictions from one or more genomic sequences. Each cluster was analyzed using an 
algorithm based on graph theory and dynamic programming to integrate cDNA and genomic 

30 inf cMmaticm, generating possible splice variants that were subsequently confirmed, edited, or 
extended to create a fiill length sequ^ce. Sequence intervals in which the entire length of the 
interval was present on more than one sequence in the clust^ were identified, and intervals thus 
identified were considered to be equival^t by transitivity. For exan^le, if an interval was present 
on a cDNA and two genomic sequences, then all three intervals were considered to be equivalent 

35 This process allows unrelated but consecutive genomic sequences to be brought together, bridged by 
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cDNA sequence. luteals thus identified wexe tbsxi "sdtched" togeth^ by the stitching algorithm 
in the aider that they appear along their parent sequences to generate the longest possible sequence, 
as well as sequrace variants. linkages between intervals which proceed along one type of parent 
sequence (cDNA to cDNA or genomic sequence to genonoic sequence) were given pref^race over 
5 linkages which change parent type (cDNA to genomic sequmce). The resultant stitched sequences 
were translated and conq)ared by BLAST analysis to the genpept and gbpri public databases. 
Incarrect exons predicted by Genscan were corrected by comparison to the top BLAST hit from 
genpqpt Sequ^ces were further extended with additicmdcDNA sequences,^ 
gnomic DNA, when necessary. 

10 **StrctchcJ* Scqacnccs 

Partial DNA sequraces were extended to full length with an algorithm based on BLAST 
analysis. Rrst, partid cDNAs assembled as described in Exanq>le in were queried aga^ 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases 
using the BLAST program. The nearest GraBank protein homolog was then CQnq>ared by BLAST 

15 analysis to either Iticyte cDNA sequences or GenScan &ioa predicted sequences described in 
Ewxxplc IV. A chimeric protein was gmerated by using the resultant hi^-scoring segment pairs 
(HSPs) to map the translated sequences onto the GenBank protein homolog. Insertions or deletions 
may occur in the chimeric protein with respect to the original GenBank protein homolog. The 
GenBank protein homolog, the chimeric protein, or both were used as probes to search for 

20 homologous genomic sequences from the public human genome databases. Partial DNA sequences 
were therefore "stretched" or extended by the addition of homologous genomic sequences. TTie 
resultant stretched sequences were examined to determine whether it contained a conq)lete gene. 
VI. Chrmnosomal Mappii^ of KAP Encoding Poiynndeotides 

The sequences which were used to assemble SEQ ID NO:21-40 were compared with 

25 sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implernentations of the Smith-Waterman algorithin. 5>eqiiencftR frnni thfty* Hatabas^ th^^t matcht^ 
SEQ ID NO:21-40 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phra^ (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Gemome Center (SHGQ, Whitehead Institute for 

30 Genome Research (WIGR), and Genethon were used to det^mine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a clust^ resulted in the 
assignment of all sequences of that cluster, mcluding its particular SEQ ID NO:, to that map 
location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
35 position of an interval, in centiMoigans, is measured relative to the terminus of the chromosome's 
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p^aroDL (The centiMoigan (cM) is a unit of measuienient based on lecomhinalion frequencies 
between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of 
DNA in humans, although this can vary widely due to hot and cold spots of lecombination.) The 
cM distances are based on genetic mariners mapped by G6n6tfaon which provide boundaries for 
5 radiation hybrid markers whose sequences were included in each of the clusters. Human genome 
maps and other resources available to the public, such as the NCBI '*G^eM£^'99" World Wide 
Web site (http://www.ncbi.nlm.nih.gov/genemap/)* can be en^loyed to determine if previously 
idratified disease genes map within or in proximity to the intervals indicated above. 

In tibis msaaner^ SEQ ID NO:33 was mapped to chromosome 12 withm the interval fiom 

10 97.10 to 113.30 centiMorgans. SEQ ID NO:3S was m2q[>ped to chromosome 3 withm the interval 
from 16.50 to 30.40 centiMorgans. SEQ ID NO:29 was mapped to chromosome 13 within the 
interval from 1 1.60 to 22.80 centiMorgans, to chromosome IS within the mterval from 7230 to 
77.30 c^tiMoigans, and to chromosome 20 within the interval fiom 57.70 to 64.10 centiMorgans. 
More than one map location is reported f<x SEQ ID NO:29, indicating that sequences having 

15 different map locations were assembled into a singje cluster. This situation occurs, for exan^le, 
whm sequences having strong similarity, but not CQaq>lete identity, are assembled into a smgle 
cluster. 

VQ. Analysbs of Polynucleotide Eqiressicm 

Northem analysis is a laboratory technique used to detect the presence of a transcript of a 
20 gene and invblves the hybridization of a labeled nucleotide sequence to a membrane on which 

RNAs from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supnL ch. 7; 

Ausubel (1995) supra, ch. 4 and 16.) 

. Analogous computer techniques applying BLAST were used to search for idaitical or 

related molecules in cDNA databases such as GenBank or LIFESEQ (Ihcyte Genomics). This 
25 analysis is much faster than multiple membrane-based hybridizations. In addition, the sensitivity of 

the conq>uter search can be modified to determine whether any particular match is categorized as 

exact or similar. The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 
30 5 xminimum {length(Seq. 1), lraigth(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 
35 productisdividedby (5 times the length of the shorter of the two sequences). The BLAST score is 
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calculated by assignisg a score of +5 for every base that matches in a high-scodng segment pair 
(HSPXand-4farevery msmatch. Two sequences inay shaie more than one HSP (separate 
g^s). If theie is mote than one HSP, then the pair with the highest BLAST score is used to 
calculate the product score. The product score represents a balance between fractional overlap and 

5 quality in a BLAST alignment Forexanq)le,a product score of 100 is produced only for 100% 
identity over the entire length of the shorter of the two sequences being conq^ared. A product score 
of 70 is produced either by 100% identity and 70% overlap at one end, or by 88% identity and 100% 
overly at the other. A product score of 50 is produced either by 100% identity and 50% overlap at 
one end, or 79% identity and 100% ov^Iap. 

10 Alternatively, polynucleotide sequences encoding KAP are analyzed with respect to the 

tissue sources from which diey were derived. For example, some fiill length sequences are 
assembled, at least in part, with overlapping Jjicytt cDNA sequences (see Exanq>le m). Each cDNA 
sequence is derived from a cDNA library constructed from a human tissue. Each human tissue is 
classified into one of the following organ/tissue categories: cardiovascular system; connective 

15 tissue; digestive system; enihryonic structures; endocrine syst^n; exocrine glands; genitalia, female; 
g^talia, male; gmn cells; hemic and immune system; liver; musculoskeletal system; nervous 
system; pancreas; respiratory system; sense oigans; skin; stomatognathic system; 
unclassified/mixed; or urinary tract The number of libraries in each category is counted and 
divided by the total number of libraries across all categories. Similarly, each human tissue is 

20 classified iuto one of the following disease/condition categories: cancer, cell line, developmental, 
inflammation, neurological, trauma, cardiovascular, pooled, and other, and the number of libraries 
in each category is counted and divided by the total number of libraries across all categories. The 
resulting percentages reflect the tissue- and disease-specific expression of cDNA encoding KAP. 
cDNA sequences and cDNA library/tissue information are found in the LIFESBQ GOLD database 

25 (Ihcyte Genomics, Palo Alto CA). 

VIH* £xtc»i^(»i of KAP Encoding Ptdynudeotides 

Full Iragth polynucleotide sequences were also produced by extension of an appropriate 
fragment of the lull l^igth molecule using oligonucleotide primers designed from this fragment 
One primer was synthesized to initiate S' extension of the known firagm^t, and the other primer was 

30 synthesized to initiate 3' extension of the known fragment The initial primers were designed using 
OLIGO 4.06 software (National Biosci^ces), or another appropriate program, to be about 22 to 30 
nucleotides in l^gth, to have a GC content of about 50% or more, and to anneal to the target 
sequence at temperatures of about 68**C to about 72**C. Any stretch of nucleotides which would 
result in hairpin structures and primar-primer dimerizations was avoided. 

35 Selected human cDNA libraries were used to extend the sequence. If more than one 
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extension was necessaiy or desired, additional or nested sets of primers were designed. 

He^MeMtyaiiqiMcation was obtained by PGR using methods weUk^^ PGR 
was p^onned in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The 
reaction mix contained DNA tenoplate, 200 nmol of each primi^, reaction buffi^ containing Mg^, 

5 QfH^O^y and 2-nu5rcaptoethanol, Taq DNA polymerase (Amrarsham Pharmacia Biotech), 
ELONGASE enzyme (Life Technologies), and Pfu DNA polymerase (Strata^e), with tiie 
foUowing parameters for primer pair PCI A and Pd B: Step 1: 94 ''C, 3 min; Step 2: 94''C, 15 sec; 
Step 3: eO'^C, 1 min; Step 4: 68**C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: eS^'Q 
Smin;Step7: storage at 4''C. In the alternative, the parameters for primsr pair T7 and SK+ were as 

10 follows: Step 1: 94**C 3 min; Step 2: 94''C, 15 sec; Step 3: 57**C, 1 mm; Step 4: 68**C, 2 min; Step 
5: Steps 2, 3, and 4 repeated 20 times; Step 6: eS'^C, 5 min; Step 7: storage at 4''C 

The conc^itration of DNA in each well was detmnined by dispensing 100 fil PIOOGREEN 
quantitation reag^t (0.25% (v/v) PKXKjREEN; Molecular Probes, Eug^ OR) dissolved in IX TE 
and 0.5 ftl of undiluted PGR product into each well of an opaque fluorimet^ plate (Coming Costar, 

IS Actm MA), allowing the DNA to bind to the reagent TheplatewasscaimedinaFluoroskann 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sanq^le and to quantify the 
concentration of DNA. A 5 /il to 10 /il aliquot of the reaction mixture was analyzed by 
electrophoresis on a 1 % agarose gel to determine which reactions were successful in extending the 
sequence. 

20 The extended nucleotides were desalted and concentrated, transferred to 3S4-well plates, 

digested with CvUI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended 

25 clcHies were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector 
(Amersham Iliarmacia Biotech), treated with Pfu DNA polyn^rase (Stratagene) to fill-in restriction 
site overtiangs, and transfected into conq)etent E. coli cells. Transformed cells were selected on 
antibicdc-<:ontaining media, and individual colonies were picked and cultured ov^night at ST^'C in 
3 S^-well plates in LB/2x carb liquid media. 

30 The cells were ly sed, and DNA was arn)lified by PGR using Taq DNA polymerase 

(Ami^ham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 
parameters: Step 1: 94**C, 3 min; Step 2: 94**C, 15 sec; Step 3: 60**C 1 min; Step 4: 72X, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 tin^s; Step 6: 72*'C, 5 mm; Step 7: storage at 4**C DNA was 
quantified by PICOGREEN reag^t (Molecular Probes) as described above. Saiiq)les with low 

35 DNA recoveries were reamplified using the same conditions as described above. Sanqiles were 
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diluted with 20% dimethysiilfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer 
sequencing primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI 
PRISM BIGDYE Tenninator cycle sequencing ready reaction kit (Applied Biosystems). 

In like mann^, fiill length polynucleotide sequences are verified using the above procedure 
5 or are used to obtain S' regulatory sequences using the above procedure along with oligonucleotides 
designed for such extension, and an ^propriate genomic library. 
IX. Labding and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:21-40 are employed to screen cDNAs, 
gnomic DNAs, or mRNAs. Although the labeling of oligonucleotides* consisting of about 20 base 

10 pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fiagmi^ts. Oligonucleotides are designed using state-of4he^ software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining SO pmol of each oligomer, 250 fzG of 
[y-^ adenosine triphosphate (Am^sham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 

15 SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 
An aliquot containing 10^ counts per minute of the labeled probe is used in a typical membrane- 
based hybridization analysis of human genomic DNA digested with one of the following 
cndonucleases : Ase I, Bgl H, Eco RI, Pst I. Xba I, or Pvu n (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

20 membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40''C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for exanq)le, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alt^native imaging means and 
compared. 

25 X. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing. See, e.g., Baldeschweiler, supra ,), 
noechanical microspotting technologies, and derivatives thereof. The substrate in each of the 
aforem^tioned technologies should be rmiform and solid with a non-porous surface (Schena 

30 (1999), supra). Sug^sted substrates include silicon, silica, glass slides, glass chips, and silicon 
wafers. Alternatively, a procedure analogous to a dot or slot blot may also be used to arrange and 
link el^nents to the surface of a substrate using thermal, UV, chemical, or mechanical bonding 
procedures. A typical anay may be produced using available methods and machines well known to 
those of ordinary skill in the art and may contain any appropriate number of elem^ts. (See, e.g., 

35 Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) C3enome Res. 6:639-645; 
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Marshall A. and L Hodgson (1998) Nat Biotecfanol. 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (BSTs\ or fragments or oligomers thereof 
may comprise the elements of the microarray . Fragments or oligomers suitable for hybridization 
can be selected using software well known in the art such as LASERGENE software (DNASTAR). 

5 Hie array elements are hybridized with polynucleoddes in a biological sample. The polynucleotides 
in the biological sample are conjugated to a fluorescent label or other molecular tag for ease of 
detection. After hybridization, nonhybridized nucleotides from the biological sanq>le are removed, 
and a fhi(»escence scanner is used to detect hybridization at each anayel^Doent Alternatively, laser 
desoibtion and mass spectrometry may be used for d^ection of hybridization. The degree of 

10 conq>lementarity and the relative abundance of each polynucleotide which hybridizes to an element 
on the microarray may be assessed In one embodim»it, microarray preparation and usage is 
described in detail below. 
Tissue o r CM ^Samplft ly^ft Tation 

Total RNA is isolated &om tissue sanoples using die guanidinium thiocy anate method and 

15 poly(A)^ RNA is purified using the oligo-(dT) cellulose method. Each polyCA)"** RNA sample is 
reverse transcribed usmg MMLV reverse-transcriptase» 0.05 pg//d oligo-(dr) primer (21xDBr)^ IX 
first strand buffer, 0.03 units/^ RNase inhibitor, 500 ijM dATP, 500 fiM dGTP, 500 lOA dTTP, 40 
iM dCTP, 40 ijM dCrP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is perfimned in a 25 ml volunoe containmg 200 ng pbly (A) * RNA with 

20 GEMBRIGHT kits (bicyte). Specific control po]y(A)* RNAs are synthesized by in vitro 

transcription from non-coding yeast graiomic DNA. After incubation at 37°C for 2 far, each reaction 
sanq>le (one witti Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium 
hydroxide and incubated for 20 mmutes at 85 ° C to tiie stqp die reaction and d^rade the RNA. 
Sanq)les are purified usuig two successive CHROMA SPIN 30 gel filtration spin columns 

25 (CIX>NTECHLaboratc»ies, Inc. (CLONTECB^ Palo Alto CA) and after combining, botii reaction 
samples are etfaanol precipitated using 1 ml of glycogen (1 mgAnl), 60 ml sodium acetate, and 300 
ml of 100% ethanol. The sample is then dried to completion using a SpeedVAC (Savant 
Instruments Inc., Holhrook NY) and resuspended m 14 5X SSC/0.2% SDS. 
Micraarrav Preparatimi 

30 Sequences of the present invention are used to geoimte array elements. Each array element 

is anq>Medfiom bacterid cells containing vectora with clewed cDNA inserts. PGR anq>lification 
uses primers complementary to tiie vector seqngnrj^s flflnlring tiia rPN A mspsrt. Array elemmts are 
amplified in thirty cycles of PCR ftom an initial quantity of 1-2 ng to a final quantity greater than 5 
fig. Amplified array elements are then purified nmng 5gKPTTAf!RyT.-4nfl (AfnptTshain Pharmacia 

35 Biotech). 
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Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Coming) are cleaned by ultrasound in 0.1% SDS and acetone^ with extensive distilled water 
washes between and after tceatmi^Qts. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West CSbiester PA), washed extensively in distilled water, 
S and coated with 0.05% aminopropyl ^ane (Sigma) in 9S% ethanol. Coaled slides are cured in a 
110"Coven. 

Array elements are applied to tibe coated glass substrate using a procedmie described in U.S. 
Patent No. 5,807,522, mcorporated herein by refeimce. 1 fd of the array element DNA, at an 
average concentration of 100 ng/fil, is loaded into the open capillary printing element by a high- 

10 speed robotic apparatus. The apparatus then deposits about 5 nl of array elenaent sample p^ slide. 
Microarrays are UV-crosslinked using a STRATAUNKER UV-crosslhik^ (Stratagene). 
Microarrays are washed at room terqperature once m 0.2% SDS and three tunes in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein m phosphate 
buffeied saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60°C followed by washes in 

15 0.2% SDS and distilled water as before. 
Hybridization - 

Hybridizaticm reactions contain 9 of sample mixture consisting of 0.2 fig each of Cy3 and 
CyS labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sanoqple 
mixture is heated to 65° C for 5 minutes and is aliquoted onto the microaxray surface and covered 

20 withan l.Scm^coverslip. Thearraysaretransfeixedtoawateiproof chamber having a cavity 
slightly larger than a micioscqpe slide. Thechamberiskeptat 100% humidity intornaUy by the 
addition of 140 fd of SX SSC in a comer of the chamber. Hie chamber containing the arrays is 
incubated for about 6.5 hours at 60** C The arrays are washed for 10 min at 45*" C in a first wash 
bufif^ (IX SSC, 0.1% SDS), three times for 10 minutes each at 4S*'C in a second wadi buffer (0.1X 

25 SSQ, and dried. 
Detection 

Reporter-labded hybridization conqilexes are detected .with a microscope equqyped with an 
Innova 70 mixed gas 10 W laser (Cohereiit» Inc. , Santa Clara CA) cqiable of generatmg q)ectral 
lines at 488 nm for excitation of Cy3 and at 632 nm for excitaticm of Cy 5 . The excitation laser ligjbt 
30 is focused (m die array uang a 20Xrnicxoscqpe objective (Nikon, Theslide 
containing the array is placed on a coo^uter-controlled X- Y stage on the microscope and raster- 
scanned past die objective. The 1.8 cm x 1.8 cm array used in the present exanq»le is scanned with 
a resolution of 20 miarometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
35 Bmitted light is spHt, based on wavelength, into two photonuiltiplier tube detectors CPMT R1477, 
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Hamamatsu Fhotomcs Systems, Bridgewater N3) conespondrng to ihe two fluQrophores. 
Appropiiate filteis positioned between the anay and die photomultiplier tabes are used to filter the 
signals. The emission maxima of the fluoiophoies used are 565 nm for Cy3 and 650 nm for Cy5. 
Each array is typically scanned twice, one scan per fluorc^hore using the appropriate filteis at the 
5 las^soun:e, although the ^aratus is capable of reconling the spectra from both & 
simultaneously. 

The sensitivity of die scans is typically calibrated using the signal intensity geaerated by a 
cDNA control qiedes added to the sanq>lenuxture at a Imownconcmtratio^ A specific location 
on the array contains a complementary DNA sequence, allowing the inteisity of the signal at tbat 

10 location to be correlated with a weight ratio of hybridizing species of 1:100,000. When two 
samples from different sources (e.g., rqxres^iting test and control cells), each labeled with a 
differrait fluorophore, are hybridized to a single array for die purpose of identifying genes that are 
differmtially expressed, the calibradon is done by labelmg samples of the calibrating cDNA with 
the two fhiorophcxres and adding identical amounts of each to the hybridization mixture. 

15 The ou^ut of the photomultiplier tube is digitized usmg a 12-bit RTI-835H analog-to- 

digital (A/D) conversion board (Analog Devices, Inc. , Ncaivood MA) mstalled in an IBM- 
conq>atible PC computer. Thedigitizeddataaredisplayedasanimag^ where the signal mtensity is 
mapped using a linear 20^olor transfocmatim to a pseudocolor scale ranging fixsm blue Oow 
signal) to red (high signal). The data is also analyzed quantitatively. Where two different 

20 fluorophores are excited and measured simultaneously, the data are first ccmected for optical 
crosstalk (due to ov^lappmg emission q)ectra) between the fluorophores usmg each fluorophore's 
emission spectninL 

A grid is superimposed over the fluorescence signal image such that the signal fiom each 
spot is centered in each element of the grid. The fluorescence signal within each element is then 

25 integrated to obtain a nurnericalvahieconiesponding to die average int^osity of the The 
software used far signal analysis is the C^MTOOLS gene expression analysis prpgram (Ihcyte). 
XL Complementary Polynudeotides 

Sequences complementary to the KAP-^icoding sequraces, or any parts thereot are used to 
detect, decrease, or inhibit expression of naturally occurring KAP. Although use of 

30 oligonucleotides conqnising fix>m about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of KAP. To 
inhibit transcription, a conqilenn^tary oligonucleotide is designed from the most unique 5* sequence 
and used to prevent promoter binding to the coding sequence. To inhibit translation, a 

35 complementary oligonucleotide is designed to prevent ribosomal binding to the KAP-encoding 
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transcript 

Xn. Expression of KAP 

Ebqxiession and purification of KAP is achieved using bacterial or virus-based expression 
systems. For expression of KAP in bacteria, cDNA is subcloned into an appropriate vector 

5 containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Exanq>les of such promoters include, but are not limited to, the trp-lac (toe) hybrid 
promoter and the TS or T7 bactmcq)hage promoter in conjunction with the lac operator regulatory 
element Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express KAP upon induction with isopropyl beta-D- 

10 thiogalactopyranoside CDTG). Expression of KAP in eukaryotic cells is achieved by infecting 
insect or mammalian cell lines with recombinant Auto^r^hica calif omica nuclear polyhedrosis 
virus (AcMNPV), commonly known as baculovirus. The nonessratial polyhedrin gene of 
baculovirus is replaced with cDNA encoding KAP by either homologous recombination or 

^ bacterial-mediated transposition involving transfer plasmid intermediates. Viral infectivity is 

15 maintained and the strong polyhedrin promoter drives Mgjh levels of cDNA transcription. 

Recombmant baculovirus is used to infect Spodopterafiueiperda (Sf9) insect cells in most cases, or 
human h^atocytes, in some cases. Infection of the latter requires additional genetic modifications 
to baculovirus. (See Engelhard, RK. et aL (1994) Ftoc. Nati. Acad. Sci. USA 91:3224-3227; 
Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945.) 

20 In most ex|HDession systems, KAP is synthesized as a fusion protein with, e.g., glutathime S- 

transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, 8ingl&-step, 
affinity-based purification of recombinant fusion protein fiom crude ceU lysates. GST, a 26- 
kilodalton enzynK^ from Schistosoma iaponicmm enables the purification of fusion proteins on 
immobilized glutathione under conditions that maintain protein activity and antigenicity (Ans^sham 

25 Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved fiom 
KAP at specifically engjmeered sites. FLAG, an S-amino acid peptide, enables immnnoaffinity 
purification using commercially available monoclonal and polyclonal anti-FLAG antibodies 
(Eastman Kodak). 6-His, a stretch of six consecutive histidine residues, enables purification on 
metal-cbelate resins (QIAl^N). Methods for protein expression and purification are discussed in 

30 Ausubel (1995, supra, ch. 10 and 16). Purified KAP obtained by these methods can be used directiy 
in the assays shown in Exan^les XVI, XVn, XVHI, XIX, XX, and XXI where applicable. 
XIII. Fundicmal Assays 

KAP function is assessed by e3q>ressing the sequences encoding KAP at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian 

35 expression vector containing a strong promoter that drives high levels of cDNA expression. Vectors 
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of choice include PCMV SPORT (Life Technologies) and FOG. 1 (hivitiogen, Carlsbad CA), both 
of which ccmtain the cytomegalovirus promote. 5-10 of leconibinant vector are transiently 
transfected into a human cell line« for exanople, an endothelial or hematopoietic cell line, using 
either liposome fonnulations or electn^oration. 1-2 jug of an additional plasmid containing 
5 sequences encoding a morkn protein are co-tiansfected. Expression of a marker protein provides a 
means to distinguish transfected cells £rom nontransfected cells and is a reliable predictor of cDNA 
egression frran the recombinant vector. Maiker proteins of choice mclude, e.g., Qreea Fluorescent 
Protein (GFP; Qontech), CD64, or a CD64-GFP fusion protein. How cytometry (FCM), an 
automated, las^ qptics-based technique, is used to identify transfected cells e:q>re5sing GFP or 

10 CD64-OFP and to evaluate the apoptotic state of the cells and other cellular properties. FCM 
detects and quantifies the iq>take of fluorescent molecules that diagnose events preceding or 
coinddent with cell death. Iliese events include changes in nuclear DNA omt^t as measuied by 
staining of DNA with prppidium iodide; changes in ceU size and granularity as measured by forward 
light scatty and 90 degree side li^ scatty, down-regulation of DNA synthesis as measured by 

15 decrease in bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular 
proteins as measured by reactivity witii specific antibodies; and alterations in plasma membrane 
composition as noeasuied by the binding of fluorescein-conjugated Annexin V protein to the cell 
surface. Methods in flow cytometry arc discussed in Qrmerod, M.G. (1994) FInw ny tniiifttrY^ 
Oxford, New York NY. 

20 The influence of KAP on gene expression can be assessed using highly purified populations 

of ceUs transfected with sequences MicodingKAP and either 0064 or aD64<^ CD64and 
CD64-GET are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G OgG). Transfected cells are efBdentiy separated from nontransfected cells 
using magnetic beads coated with either human IgG or antibody against CD64 (D YNAL, Lake 

25 Success NY). mRNAcanbepurifiedfromthecellsusingiriethodsweU known by those of skill in 
the art Expression of mRNA encoding KAP and other g^es of interest can be analyzed by 
northern analysis or microarray techniques. 
XIV. Piroducticm of KAP Specific Antibodies 

KAP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 

30 Harrington, M.G. (1990) Metiiods EnzymoL 182:488^95), or other purification techniques, is used 
to immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the KAP ammo acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of sldOUi in the art Methods for 

35 selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are 



91 



wo 02/46384 



PCT/USOl/47431 



well described in the art. (See» e.g., Ausubel, 1995, supra, ch. 1 1.) 

Typically, oligopeptides of about IS residues in length are synthesized using an ABI 431A 
peptide synthesizer (Applied Biosystems) using FMOC chendstcy and coupled to KLH (Sigma- 
Aldrich, St Louis MO) by reaction with N-naaleiniidobenzoyl-N4iydroxysuccinimide ester (MBS) 
5 to increase inununogenicity. (See, e.g., Ausubel, 1995, supra. ) Rabbits are innnunized with the 
oligopeptide-KLH conqilex in coiiq>lete Hieund^ adjuvant Resulting antisera are tested for 
antipeptide and anti-KAP activity by, for example, binding the peptide or KAP to a substrate, 
blocking with 1% BSA, reacting with rabbit antism, washmg, and reacting with radio-iodinated 
goat anti-rabbit IgG. 

10 XV. Purification of NataraDy Occurrmg KAP Vsiog Specific Antibodies 

Naturally occuning or recombinant KAP is substantially purified by inununoafEmity 
cbroniatography using antibodies specific frar KAP. An immimoaEBnitycolunm is constructed by 
covalentiy coiq>ling anti-KAP antibody to an activated chromatographic resin, such as 
C3®r-activated SEPHAROSB (Amersham Pharmacia Biotech). After the coiq>ling, the resin is 

15 blocked and washed according to the manu&cturer^ instractions. 

Media containing ElAP are passed over tiie immunoafBnity column, and the column is 
washed und^ conditioos tiuit allow the piefer^al absorbance of KAP (e.g., high ionic strength 
buffers in the presence of deterg^t). The column is eluted under conditions that disnq>t 
antibody/KAP famding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrqpe, such 

20 as urea or tfaiocyanale ion), and KAP is collected. 

XVI. Identification of Molecules Which Interact with KAP 

KAP, or biologically active fragments thereof, are labeled with Bolton-Honter reagent 
(See, e.g., Bolton, AB. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
previously anayed in the wells of a multi-well plate are incubated with tte labeled KAP, washed, 

25 and any wells with labeled KAP conq>lex are assayed. Data obtained using different concentrations 
of KAP are used to calculate values for the number, afBnity, and association of KAP with the 
candidate molecules. 

Altmiatively, molecules interacting with KAP are analyzed using the yeast two-hybrid 
system as described in Helds, S. and O. Song (1989) Nature 340:245-246, or using commerdaUy 
30 available kits based an the two-hybrid system, such as the MATCHMAKER system (Qontech). 

KAP may also be used in tiie P ATHCALLING process (CuraGen Corp. , New Havm CT) 
which employs the yeast two-hybrid system in a high-throughput manner to detemoine all 
interactions betwe^ the proteins encoded by two laige libraries of genes (Nandabalan, K. et al. 
(2000) U,S. Patent No. 6.057,101). 
35XVIL Demonstration of KAP Activity 
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Generally, protein kinase activity is measured by quantifying the phosphorylatiQn of a 
protrin substrate by KAP in the presence of [y-^] ATP. KAP is incubated with the protem 
substrate, ^-ATP, and an appropriate kinase buffer. The ^ incorporated into the substrate is 
separated from ftee ^-ATP by electrophoresis and flie incarporated ^ is counted usmg a 
5 radioisotc^ counter. Tlieaniountof incorporated^ is proportional to the activity of KAP. A 
determination of the specific amino acid residue phosphorylated is made by phosphoamino acid 
analysis of the hydrolyzed protein. 

hi one alternative, protein kinase activity is measured by quantifying the transfer of gamma 
phosphate from adenosine triphosphate (ATP) to a serine, threonine or tyrosine residue in a protein 

10 substrate. The reaction occurs between a protein kinase sample with a Inotinylated pqptide substrate 
and gamma ^-ATP. Following the reaction, free avidin in solution is added for bmding to the 
biotinylated ^-peptide product The Innding sanqple then undeigoes a centrifugal ultrajSltiadon 
process with a membrane which will retain the product-^vidin comqplex and allow passage of free 
gamma^-ATP. Thereservoirof the centrifuged unit contaiiiing the ^-peptide product as 

15 retentate is then counted in a scintillation counter. This procedure allows the assay of any type of 
protein kmasesanq>le,dq)endmg on die peptide substrate and kinase react! This 
assay is provided in kit form (ASUA, Affinity Ultrafiltration Separation Assay, Transbio 
CoipQration,Baltim(»eMD,U.S. Patent Na 5,869,275). Suggested substrates and thek respective 
enzymes include but are not limited to: Histone HI (Sigma) and p34'''''%inase, Aimexin I, 

20 Angiotensin (Sigma) and EGF receptor kmase, Annexin n and src kinase, ERKl & ERK2 substrates 
and MEK« and myelin basic protein and ERK OE^earson, J.D. et al. (1991) Methods Enzymol. 
200:62-81). 

hi another alternative, protein kinase activity of KAP is demonstrated in an assay containing 
KAP, SO (J of kinase buffer, 1 ^g substrate, such as myelin basic protein (MBP) synthetic 

25 peptide substrates, ImM DTI, 10 (igAlP, and 0 J )iQ[Y^^]ATP. The reaction is incubated at 
SO^^C for 30 rnmutes and stopped by pQ)ettu]g onto P81 paper. The unmccxporated [y-^] ATP is 
removed by washing and the inccxporated radioactivity is measured using a scmtiHation counter. 
Alternatively, the reaction is stopped by heating to lOO'^C in the tnresmce of SDS loading buffer and 
resolved on a 12% SDS polyacrylamide gel followed by autoradiography. The amount of 

30 incorporated ^ is proportional to Ae activity of KAP. 

In yet another alternative, adenylate kinase or guanylate kmase activity of KAP may be 
measured by die incoqKtration of ^ from [y-^] ATP mto ADP or GDP using a gamma 
radioisotope counter. KAP, in a kinase buffer, is incubated together with the appnqxriate nucleotide 
mono-phosphate substrate (AMP or GMP) and ^-labeled ATP as the phosphate donor. The 

35 reaction is mcubated at ST'C and tftrminntftH by addition of trichloroacetic add. The acid extract is 
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neutralized and subjected to gel electropboiesis to separate the mono-, di-, and tnphosphonucleotide 
fractions. The diphosphonucleotide fraction is excised and counted. The radioactivity lecoveied is 
proportional to the activity of KAP. 

Jn yet another alternative, other assays for KAP include scintillation proximity assays 
S (SPA), scintillatioii plate technology and filter binding assays. Useful substrates include 
recombinant proteins tagged with glutathione transferase, or synthetic peptide substrates tagged 
with biotin. Lohibitors of KAP activity, such as small organic molecules, proteins or peptides, may 
be identified by such assays. 

In another alternative, phosphatase activity of KAP is measured by the hydrolysis of para- 

10 nitrophenyl phosphate (PNPP). KAP is incubated together with PNPP in HEH3S buffer pH 7.5, in 
the presence of 0.1% p-mercq>toethanoI at 3T*C for 60 min. The reaction is stopped by the addition 
of 6 ml of 10 N NaOH CWamond, ILH. et al. (1994) Mol. Cell. Biol. 14:3752-62). Alternatively, 
acid phosphatase activity of KAP is demonstrated by incubating KAP-containing extract v^th 100 
fil of 10 mM PNPP in 0. 1 M sodium citrate, pH 4 J, and 50 fil of 40 mM NaQ at 37 for 20 min. 

15 The reaction is stqsped by the addition of 0.5 ml of 0.4 M gjiycme^aOH, pH 10.4 (Safdg, P. ^ al. 
(1997) J. Biol. Ca^m 272:18628*18635). The increase in light absorbance at 410 nm resulting fiom 
the hydrolyds of PNPP is measured using a spectrophotometer. The increase in li^ absorbance is 
proportional to tihe activity of KAP in the assay.^ 

In the altemative, KAP activity is determined by irosming tiie amount of phosphate 

20 removed firom a phosphorylatedprotem substrate. Reactions are perfonned with 2 (» 4 nM KAP in 
a final volume of 30 iil containing 60 mM Tris, pH 7.6, 1 mM EDTA, 1 mM EGTA, 0. 1 % 
^mercaptoethanol and 10 fjM substrate, ^-labeled on serineAfareonine or tyrosine, as ai^iropriate. 
Reactions are mitiated with substrate and incubated at 30° C for 10-15 mm. Reactions are quenched 
witii 450 nl of 4% (wN) activated charcoal in 0.6 M HQ, 90 mM NaJPzO?* and 2 mM NaH2p04, 

25 then centrifuged at 12,000 xg for 5 min. Acid-soluble ^ is quantified by liquid sdntillation 
counting (Sinclair, C. et al. (1999) J. Biol. Chem. 274:23666-23672). 
XVm. Kinase Binding Assay 

Binding of KAP to a FLAG-CD44 cyt fusion protein can be determined by mcubating KAP 
with anti-KAP-conjugated immunoafBnity beads followed by incubating portions of the beads 

30 (having 10-20 ng of protem) with 0.5 ml of a binding buffer (20 mM Tris-HCL (pH 7.4), 150 mM 
NaQ. 0.1% bovine serum albumin, and 0.05% Triton X-100) in tiie presence of ^^-labeled FLAG- 
CD44cyt fusion protein (5,000 cpmfng protein ) at 4 for 5 hours. Following binding, beads were 
washed thoroughly in the binding buffer and the bead-bound radioactivity measured in a 
scintillation counter (Bomgnignon, L.Y.W. et al. (2001) J. Biol. Chem. 276:7327-7336). The 

35 amount of incorporated ^ is proportional to the amount of bound KAP. 
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XDL Identification of KAP Inhibitors 

Cooapounds to be tested aie airayed in the wells of a 384-well plate m vaiying 
concentrations along with an appropriate buffer and substrate, as described in the assays in Exanqile 
XVn. KAPactivity is measured for each weU and the abiUty of each conq)omid to 
5 activity can be det^mined, as weU as the dosenrespoiise kinetics. This assay could also be used to 
identify molecules which enhance KAP activity. 

XX. Identification of KAP Substrates 

A KAP "substrate-trapping*' assay takes advantage of the increased substrate affinity that 
may be conferred by certain inutadons in the FTP signature sequence of protein tyrosine 

10 phosphatases. KAP bearing these mutations fonn a stable complex with their substrate; this 
complex may be isolated biochemically. Site-directed mutagenesis of invariant residues in the FTP 
signature sequence in a clone encoding the catalytic domain of ElAP is performed using a method 
standard in the art or a coinmerdal kit, such as the MUTA-CkENBM^ For 
expression of KAP mutants in Escherichia colL DNA fragments containing die mutation axe 

15 exchanged with the corresponding wild-type sequence in an expression vector bearing the sequence 
encoding E[AP or a glutatUoneS-transferase(GST)-KAPfusi(»i protein. KAP mutants are 
expressed in E. cell and purified by chromatography. 

The expression vector is transfected into COSl or 293 cells via calcium phosphate-mediated 

^ transfection widi 20 fig of CsQ-^urified DNA per 10-cm dish of cells or 8 /Ag per 6-cm dish. Forty- 

20 ei^ hours after transfection, cells are stimulated with 100 ng/ml epidermal growth factor to 
increase tyrosine phosphorylation in cells, as the tyrosine kinase EGFR is abundant in COS cells. 
Cells are lysed in 50 mM Tris HQ, pH 7.5/5 mM EDTA/150 mM Naa/1% Triton X-100/5 mM 
iodoacetic acid/10 mM sodium phosphate/10 mM NaF/5 /ig/ml leupeptin/5 ng/wl aprotinin/l mM 
benza midine (1 ml per 10-cm dish, 0.5 ml per 6-cm dish). KAP is inmumoprecipitated from lysates 

25 with an appropriate antibody. GST-KAP fusion proteins are precipitated with 

ghitatUone-iSepharose, 4 ^g of mAb or 10 ^1 of beads respectively per mg of cell lysate. 
Conq)lexes can be visualized by PAGE ot further purified to identify substrate molecules (Hint, 
AJ. et al. (1997) Proc. Nad. Acad. Sci. USA 94:1680-1685). 

XXI. Enhancttiient/InhilMtk>n of Protein Kinase Activity 

30 Agonists or antagcxiists of KAP activation or inhibition may be tested using assays 

described in section XVIL Agonists cause an increase in KAP activity and antagonists cause a 
decrease in KAP activity. 

Various nKx&fications and variations of the described methods and systems of the mvention 
35 will be apparent to those skilled in the art without departmg from the scope and spirit of the 
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invention. Although the inventicm has been described in connection with obtain embodiments, it 
should be understood that the invration as claimed should not be unduly limited to such specific 
embodiments. Indeed, various modifications of the described modes for canying out the invention 
which aie obvious to those skilled in molecular biology or related fields are intended to be within 
5 the scope of the following claims. 
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Annotation 


[Rattus norvegicus] protein tyrosine phosphatase TDM. Cao. L. et al. (1998) J. 
Biol. Chem. 273:21077-21083 


myotubularin related 1 [Homo sapiens], Kioschis, P. et al. (1998) Genomics 
54:256-266 


Protein phpsphatase 2C (PP2C) [Fagus sylvatica]. 1 


[Mus musculus] thiamin pyrophosphokinase. Nosaka, K. et al. (1999) J. Biol. 
Chem.274:34129-34133 


[Homo sapiens] protein kinase PAK5 1 


[Homo sapiens] MEK kinase 1. Xia, Y. et al. (1998) Genes Dev. 12:3369-3381 


[Mus sp.] serine/threonine- and tyrosine-specific protein kinase, Nekl=NIMA cell 
cycle regulator homolog. Letvidn,K..etal. (1992)EMBO J. 11:3521-3531 


l^ttus norvegicus] DLG6 alpha. Inagaki, H. et al. (1999) Biochem. Biophys. 
Res. Commun. 265:462-468 


protein kinase [Mus musculus]. (Walden J>.D. and Cowan,N.J. (1993) Mol. Cell. 
Biol. 13: 7625-7635) 


ribosomal S6 kinase [Homo sapiens]. (Zhao,Y. et al. (1995) Mol. Cell. Biol. 15: 
4353-4363) 


protein kinase [Mus musculus]. (KuengJP. et al. (1997) J. CeU Biol. 139: 1851- 
1859) 
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Nature 399: 323-329) 
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Analytical Methods and 
Databases 


BLASTJPRODOM 






PROFIT KSCAN 


BLAST J*RODOM 


BLASTJDOMO 


MOTIFS 


MOTIFS 


Signature Sequences, Domains and Motifs 


HYDROXYPYRUVATE REDUCTASE PLASMID 


OXnXJREDUCr ASE N ADP PROTEIN 
GLYCERATE KINASE. PD014236: K131-T357, 
T357-L520 


Eukaryotic protein kinase domain: F44-E276 


Transmembrane domain: D133-I161 N-tenninus is 
cytosolic. 


Protein kinases signatures and profile: T140-E198 


CASEIN KINASE I, OAMMA 1 ISOFORM EC 
2.7.1. OAMMA TRANSFERASE 


5BR1NE/THREONINB ATP BINDINQ 
MULTIOENB FAMILY PHOSPHORYLATION; 
PD049080: M1-N43, PD015080: F315-W379 


AIN: 

; V46-Y304; 
: V46-Y304; 
V46-Y304; 
; V46-V293 


Protein kinases ATP-binding region signature: 150- 
K73 


Serine/Threonine protein kinases active-site 
signature: L160-I172 


PROTEIN KINASE DOM 
DM00004|A5671I|46-303: 
pM00004|C56711|45-301: 
DM00004|B56711|48-303: 
DM00004|D56406|31-276; 


Potential 

Glycosylation Sites 




N370 N388 














Potential 

Phosphorylation Sites 




S96S124S150S229 
S373T14T137T199 
T2I4T258 T269 
T273 T355T411 
T454 
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TMAP 


1 


MOTIFS 


MOTIFS 1 


BLAST-DOMO 


Signature Sequences, Domains and Motifs 


Transmembrane domain: A868-A890; N-terminus is 
cytosolic 


Protein kinases signature: L284-F339 


Serine/Threonine protein kinases active-site 
signature:a05-D17 


Leucine zipper pattern: L826-L847 


Protein kinase domains: DM00004|A48084|98-348: 
V704-R943; DM00004|Q01389|1 176-1430: V704- 
T945; DM00004|Q10407|826-1084: V704-T945; 
DM00004|P41892|ll-249: L705. 
T946J)M00004|P51957|8-251: L187-R427. 
DM00004|P41892|ll-249: L187-V395, 
DM00004|Q05609|553-797: B186-C419 
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What is claimed is: 

1. An isolated polypeptide selected fnim the group consistii^ 

a) a polypeptide comprising an amino acid sequence selected from the groi^ 
5 consisting of SEQ ID NO: 1-20, 

b) a polypeptide coix9)rising a naturally occurring amino acid sequence at least 90% 
identical to an ammo add sequrace selected from die group consisting of SEQ ID 
NO: 1-20, 

c) a biologically active fragment of a polypeptide having an amino acid sequence 
10 selected from the group consisting of SEQ ID NO: 1-20, and 

d) an immunogenic fragment of a polypeptide having an amino acid sequence selected 
from the groiqp consisting of SEQ ID NO:1-20. 

2. An isolated polypeptide of claim 1 comprising an amino add sequence sdected from the 
15 group consisting of SEQ ID NO: 1-20. 

3. An isolated polynucleotide aicoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polyp^tide of claim 2. 

20 

5. An isolated polynucleotide of claim 4 coiiq>rising a polynucleotide sequence selected 
from the groiq) consistmg of SEQ ID NO:21-40. 

6. A recombinant polynucleotide conq>rising a promoter sequence operably linked to a 
25 polynucleotide of claim 3. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

30 

9. A method of producing a polypeptide of claim 1, the method conqnising: 

a) culturing a ceU under conditions suitable for expression of the polypeptide, wherein 
said cell is transfrsnned with a recombinant polynucleotide, and said recombinant 
polynucleotide conqxrises a promoter sequence operably linked to a polynucleotide 
35 encoding the polypeptide of claim 1, and 



141 



wo 02/4^84 



PCT/USOl/47431 



b) recovering the polypeptide so expressed. 

10. A method of claim 9, wherein the polypeptide conqirises an amino acid sequ^ce 
selected from the group consisting of SEQ ID NO: 1-20. 

5 

1 1. An isolated antibody which specifically binds to a polypeptide of claim 1. 

12. An isolated polynucleotide selected fiom the group consisting of: 

a) a polynucleotide conqxrising a polynucleotide sequence selected fraan the group 
10 consisting of SEQ ID NO:21-40. 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected firom the group consisting of 
SEQIDNO:21-40, 

c) a polynucleotide conq)lementaiy to a polynucleotide of a), 

IS d) a polynucleotide coiiq)lenaentaiy to a polynucleotide of b), and 

e) an SNA equivalent of a>d). 

13. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 12. 

20 

14. A method of detecting a target polynucleotide in a san^le, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) hybridizing the sample with a probe conq)iising at least 20 contiguous nucleotides 
conqnising a sequence compl^nentary to said target polynucleotide in the sample, 

25 and which probe specifically hybridizes to said target polynucleotide, under 

conditions whereby a hybridization conq>Iex is formed betwem said probe and said 
target polynucleotide or fi-agments thereof, and 

b) detecting the presence or absence of said hybridization conoplex, and, optionally, if 
present, the amount thereof . 



30 



IS. A method of claim 14, wherein the probe conoprises at least 60 contiguous nucleotides. 



16. A method of detecting a target polynucleotide in a sanq)le, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method coiiq)rising: 
3S a) amplifying said target polynucleotide or fragment thereof using polymerase 
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reaction amplification, and 
b) detecting tiie piessace or absence of said amplified taiget polynucleotide or 
fragment thereof, and, optionally, if present, die amount thereof. 

5 17. Acompodtioncon^risingapolypeptideof claim 1 and a phannaceutically acceptable 

excipient 

18. A conq)osition of claim 17, whmin the polypeptide conqxrises an amino acid sequence 
selected firom the group ccmsisting of SEQ ID NO; 1-20. 

10 

19. A method for treating a disease or condition associated with decreased expression of 
functional KAP, con^rising administering to a patient in need of such treatment the 
composition of claim 17. 



15 20. Amethodof screening a compound for effectiveness as an agonist of a polypeptide of 

. claim 1, the method conqnising: 

a) exposing a sample conqirising a polypeptide of claim 1 to a conq>ound, and 

b) detecting agonist activity in the sample. 

20 21 . A conq>osition con^rising an agonist compound identified by a method of claim 20 and 

a pharmaceutically acc^table excipient 

22. A method for treating a disease or condition associated with decreased expression of 
functional KAP, conqnising administering to a patient in need of such treatment a 

25 conq)ositionof claim21. 

23. A method of screening a conq>ound for effectivraess as an antagonist of a polypeptide 
of claim 1, the method coniprising: 

a) e>q)osing a saiiq>le con^rising a polypeptide of claim 1 to a conqx}und, and 
30 b) detecting antagonist activity in the sample. 

24. A con^position comprising an antagonist conq)Ound identified by a method of claim 23 
aixi a pharmaceutically acceptable excipient 

35 25. A method for treating a disease or condition associated with overexpression of 
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functional KAP, comprising adnumstermg to a patient in need of such tteatnrat a 
con:q)osition of claim 24. 



26. Aniethodof screening for a compound tbatspecificaUy binds to tbej^^ 
5 claim 1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test conq)ound under 
suitable conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test conq)ound, thereby 
identifying a conopound diat q)ecifically Innds to tibe polyp^tide of claim 1. 

10 

27. Amethodof screening for a compound that noodulates the activity of die poly^ 
claim 1, the method conq)rising: 

a) combining the polypeptide of claim 1 with at least one test conQ)ound under 
conditions permissive for the activity of the polypeptide of claim 1, 
15 b) assessing the activity of the polypeptide of claim 1 in the presence of the test 

compound, atid 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test 
compoimd with the activity of the polypeptide of claim 1 in the absence of the test 
compound, wherein a change in the activity of the polypeptide of claim 1 in the 

20 presence of the test conq>ound is indicative of a conq>ound that modulates the 

activity of the polypeptide of claim 1. 

28. A method of screening a con^und for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide conq)rises a sequence of claim 5, the 

25 method comprising: 

a) e7q>osing a sanq)le conqnisiag the target polynucleotide to a conq>ound, under 
conditions suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) conoparing the expression of the target polynucleotide in the presence of varying 
30 amounts of the conqmund and in the absence of the conq)ound. 

29. Amethodof assessing toxidty of a test compound, the nietfaod comprising: 

a) treating a biological sample containing nucleic acids with the test conq)ound, 

b) hybridizing the nucleic acids of the treated biological sample with a probe 

35 coniprising at least 20 contiguous nucleotides of a polynucleotide of claim 12 und^ 
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conditions whereby a specific hybridization complex is formed between said probe 
and a target polynucleotide in the biological san5>le, said target polynucleotide 
conq)rising a polynucleotide sequence of a polynucleotide of claim 12 or fragment 
thereof, 

5 c) quantifying the amount of hybridization coiiq)lex, and 

d) conq)armg the amount of hybridization complex in the treated biological sanq)le 
with the amount of hybridization con:q>lex in an untreated biological sanq>le, 
wherein a difference in tl^ amount of hybridization complex in the treated 
biological sample is indicative of toxicity of the test con:q>ound. 

10 

30. A diagnostic test for a omdition or disease assodated with the ^qpression of KAP in a 
biological sample* the method conq)rising: 

a) combimng the biological sample with an antibody of claim 1 1, under conditions 
suitable for the antibody to bind the polypeptide and form an antibodyipolypeptide 

15 conqplex, and 

b) detecting the complex, wherein the presrace of the complex conelates with the 
presence of the polypq>tide in the biological sanq)le. 

31. Theantibodyofclaim 11, wherein the antibody is: 
20 a) a chimeric antibody, 

b) a single chain antibody, 

c) aFabfragmrait, 

d) a F(ab')2 fiagmrait, or 

e) a humanized antibody. 

25 

32. A composition cra^nising an antibody of claim 11 and an acceptable excqxient 

33. Ainetiiodof diagnosing a conditian or disease associated with die e3q;»^ 
in a subject, comprising administering to said subject an ^ective amount of the 

30 conxpositicm of claim 32. 

34. A composition of claim 32, wherein the antibody is labeled. 



35 



35. A method of diagnosmg a condition or disease associated with the esqpression of KAP 
in a subject, conqnising administering to said subject an effective amount of the 
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conq>ositioii of claim 34. 

36. A method of preparing a polyclonal antibody with the specificity of the antibody of 
claim 1 1, the method cooogprising: 

5 a) immunizing an animal with a polypeptide consisting of an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1-20, or an immunogenic 
fragment thereof, under conditions to elicit an antibocfy response, 

b) isolating antibodies from said animal, and 

c) screening the isolated antibodies with the polypeptide, thereby identifymg a 

10 polyclonal antibody which binds specifically to a polypeptide conoprising an amino 

acid sequence selected from the group consisting of SEQ ID NO: 1-20. 

37. A polyclonal antibody produced by a method of claim 36. 

15 38. A conqyositiQn cawpiising the polyclonal antibody of claim 37 and a suitable canier. 

39. Ametfaodofmakingamonoclonalantibody withtfaespedfidty oftfaeantib^ 
claim 11, the mediod conqirising: 

a) immunizing an animal with a polypeptide consisting, of an amino acid sequence 
20 selected from the group consisting of SEQ ID NO: l-20» or an immunog^c 

fragment thereof, under conditians to elicit an antibody response, 

b) isolating antibody producing cells from the animal, 

c) fusing Il» antibody producing cells with immortalized cells to form monoclonal 
antibo^-fxroducing hybridoma cells, 

25 d) cultucing the hybridoma cells, and 

e) isolating from the culture mcmoclonal antibocfy which binds specifically to a 

polypeptide conopising an amino add sequence selected fiDm the group consisting 
ofSBQIDNO:l-20. 

30 40. Amcmoclonalantibodyivoducedby a method of claim 39. 

41. A can9K)6itiQn conqnising the monoclcmal antibody of claim 40 and a suitable carrier. 



35 



42. The antibody of daim 11, wherein the antibody is produced by scre^iing a Fab 
expression library. 
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43. The antibody of claim 1 1, wherein the antibody is produced by screening a recombinant 
immunoglobulin library. 

44. A method of detecting a polypeptide comprising an amino acid sequwice selected from 
5 the group consisting of SEQ ID NO: 1-20 in a saii9>le, the method comprising: 

a) incubating the antibody of claim 1 1 with a sample under conditions to allow 
specific binding of the antibody and the polypeptide, and 

b) detecting specific binding, wherein specific binding indicates the presence of a 
polypeptide conqnising an amino add sequence selected from the group consisting 

10 of SEQ ID NO: 1-20 in die sample. 

45. A method of purifying a polypeptide conqnising an amino acid sequence selected from 
Ae group consisting of SEQ ID NO: 1-20 from a sanq)le, the method con5)rising: 

a) incubating the antibody of claim 1 1 with a sanq>le under conditions to allow 
15 specific binding of the antibody and the polypeptide, and 

b) separating the antibody fmm the sample and obtaining the purified polypeptide 
couD^rising an amino add sequence selected from the group consisting of SEQ ID 
Nai-20. 

20 46. A microairay wherein at least one element of the nmcroarray is a polynucleotide of 

claim 13. 

47. AmetiK>dof generating an expression profile of a sanq>le which contains 
polynucleotidies, the method conqxrismg: 

25 a) labeling the polynucleotides of tiiesanq>le, 

b) contacting the elements of the microarray of claim 46 with the labeled 
polynucleotides of the sample under conditions suitable for the formation of a 
hybridization con^lex, and 

c) quantifymg the e^qpression of the polynucleotides in the sample. 

30 

48. An array conqxridng di£f«rent nucleotide molecules a£Bxed in distinct physical 
locations on a solid substrate, wb«ein at least one of said nucleotide molecules conqirises a 
first oligonucleotide or polynucleotide sequence specifically hybridizable with at least 30 
contiguous nucleotides of a target polynucleotide, and wherein said target polynucleotide is 

35 a polynucleotide of claim 12. 
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49. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely con^lementary to at least 30 contiguous nucleotides of said target 
polynucleotide. 

5 50. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 

completely conq)lementary to at least 60 contiguous nucleotides of said target 
polynucleotide. 

51. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
10 completely conq>lementary to said target polynucleotide. 

52. Ananay of claim48, whichisamicroarray. 

53. An array of claim 48, further conqnising said target polynucleotide hybridized to a 
15 nucleotide molecule coiiq>rising said first oligonucleotide or polynucleotide sequence. 

54. Anatray of claim 48, whearein a linker joms at least one of said nucleotide m 
said solid substrate. 

20 55. Anarxay of claim 48, wherein each distinct physical location on the substrate contains 

multiple nucleotide molecules, and the multiple nucleotide molecules at any single distinct 
physical location have the same sequence, and each distinct physical location on the 
substrate contains nucleotide molecules having a sequence which differs fixim the sequoice 
of nucleotide molecules at another distuK^t physical location on the substrate. 

25 

56. A polypeptide of claim 1, comprismg the anoino acid sequence of SEQ ID NO:l. 

57. Apolypeptideof claim l,coinprising the amino add sequence of SEQ ID NO:2. 
30 58. Apolypq>tideof claim l,comprisiiig the amino acid sequence of SEQ ID N0:3. 

59. A polypeptide of claim 1, conprising the ammo acid sequence of SEQ ID NO:4. 



60. A polypeptide of clahn 1, comprising the amino acid sequence of SEQ ID N0:5. 

35 
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61. A polypeptide of claim 1, conqnising the amino acid sequence of SEQ ID N0:6. 
6Z A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:7. 
5 63. A polypq)tide of claim 1, comprising the amino acid sequence of SEQ ID NO:8. 

64. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:9. 

65. A polypeptide of claim 1, conqmsing the amino acid sequence of SEQ ID NO: 10: 

10 

66. A polypeptide of claim 1» conqnising the amino acid sequence of SEQ ID NO:ll. 

67. A polypeptide of claim I, conqnising the amino add sequ^ice of SEQ ID NO: 12. 
15 68. A polypeptide of claim 1, conq)rising the amino acid sequence of SEQ ID NO: 13. 

69. Apolypeptide of claim 1, comprising the amino add sequence of SEQ ID NO:14. 

70. A polypeptide of claim 1, conqirising the amino add sequmce of SEQ ID NO: 15. 

20 

71. A polypeptide ofclaiml,coiiqzrisingttie amino acid sequent 

72. A polypq>tide of claim l^conqmsing the amino add sequence of SEQ 

25 73. Apolyp^tideofclaiml,coiiqHrisingtheaminoaddsequeQceofSEQn>NO:18^ 

74. A polypeptide ofclaiml,coiiq)ridng the amino add sequence ofSEQ ID NO: 

75. A polypeptide of claun 1» comprising the amino add sequmce of SEQ ID NO:20. 

30 



76. A polynucleotide of claim 12, comprismg the polynudeotide sequence of SEQ ID 

N0:21. 

35 
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77. A polynucleotide of claim 12, conqnising the polyniKleotide sequence of SEQ ID 
NO:22. 

78. A polynucleotide of claim 12, conqnising the polynucleotide sequence of SEQ ID 
5 Na23. 

79. A polynucleotide of claim 12, conqnising the polynucleotide sequence of SEQ ID 
NO:24. 

10 80. A polynudeotide of claim 12, comiHising the polynucleotide sequrace of SEQ ID 

NO:25. 

81. A polynucleotide of claim 12, conqnising the polynucleotide sequence of SEQ ID 
NO:26. 

15 

82. A polynucleotide of claim 12, conqnising the polynucleotide sequeiK» of SEQ ID 
NO:27. 

83. A polynudeodde of claim 12, conqnising the polynucleotide sequence of SEQ ID 
20 NO:28. 

84. A polynucleotide of claim 12, am^rising the polynucleotide sequence of SEQ ID 
NO:29. 

25 85. Apc^rnudeotideof claim 12, OHiqnising die polynucleotide sequence of SEQ ID 

NO:30. 

86. A polynucleotide of claim 12, conqnising the polynucleotide seqoraoce of SEQ ID 
N0:31. 

30 

87. A pdynudeotide of claim 12, conqnising the polynucleotide seqa«ice of SEQ ID 
NO:32. 

88. A pdynudeotide of claim 12, conqnising the polynucleotide sequence of SEQ ID 
35 NO:33. 

ISO 
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89. A polynucleotide of claim 12, coII^nisi]lg the polynucleotide sequence of SEQ ID 
NO:34. 

90. A polynucleotide of claim 12, conqnising the polynucleotide sequence of SEQ ID 
5 NO:35. 

91. A polynucleotide of claim 12, conqnrising the polynucleotide sequence of SEQ ID 
NO:36. 

10 92. A polynucleotide of claim 12, con^rismg the polynucleotide sequence of SEQ ID 

NO:37. 

93. A polynucleotide of claim 12, conqnising the polynucleotide sequence of SEQ ID 
NO:38. 

15 

94. A polynucleotide of claim 12, conqtrising the polynucleotide sequence of SEQ ID 
NO:39. 

95. A polymicleotide of claim 12, conqnising the polynucleotide sequence of SEQ ID 
20 Na40. 
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Ala Val Pro Val Thr Trp Thr Glu He Phe Ser 61y Lys Ser Val 
95 100 105 

Ala His Glu Asp He Lys Tyr Glu Gin Ala Cys He Leu Tyr Asn 

110 115 120 

Leu Gly Ala Leu His Ser Met Leu Gly Ala Met Asp Lys Arg Val 

125 130 135 

Ser Glu Glu Gly Met Lys Val Ser Cys Thr His Phe Gin Cys Ala 

140 145 150 

Ala Gly Ala Phe Ala Tyr Leu Arg Glu His Phe Pro Gin Ala Tyr 

155 160 165 

Ser Val Asp Met Ser Arg Gin He Leu Thr Leu Asn Val Asn Leu 

170 175 180 

Met Leu Gly Gin Ala Gin Glu Cys Leu Leu Glu Lys Ser Mfet Leu 

185 190 195 

Asp Asn Arg Lys Ser Phe Leu Val Ala Arg He Ser Ala Gin Val 

200 205 210 

Val Asp Tyr Tyr Lys Glu Ala Cys Arg Ala Leu Glu Asn Pro Asp 

215 220 225 

Thr Rla Ser Leu Leu Gly Arg He Gin Lys Asp Trp Lys Lys Leu 

230 235 240 

Val Gin Met Lys He Tyr Tyr Phe Ala Ala Val Ala His Leu His 

245 250 255 

Met Gly Lys Gin Ala Glu Glu Gin Gin Lys Phe Gly Glu Ar^ Val 

260 265 270 

Ala Tyr Phe Gin Ser Ala Leu Asp Lys Leu Asn Glu Ala He Lys 

275 280 • 285 

Leu Ala Lys Gly Gin Pro Asp Thr Val Gin Asp Ala LeU Arg Phe 

290 295 300 

Thr Met Asp Val He Gly Gly Lys 'Vyr Asn Ser Ala Lys Lys Asp 

305 310 315 

Asn Asp Phe He Tyr His Glu Ala Val Pro Ala Leu Asp Thr Leu 

320 325 330 

Gin Pro Val Lys Gly Ala Pro Leu Val Lys Pro Leu Pro Val Asn 

335 340 345 

Pro Thr Asp Pro Ala Val Thr Gly Pro Asp He Phe Ala Lys Leu 

350 355 360 

Val Pro Met Ala Ala His Glu Ala Ser Ser Leu Tyr Ser Glu Glu 

365 370 375 

Lys Ala Lys Leu Leu Arg Glu Met Met Ala Lys He Glu Asp Lys 

380 385 390 

Asn Glu Val Leu Asp Gin Phe Met Asp Ser Met Gin Leu Asp Pro 

395 400 405 

Glu Hhr Val Asp Asn Leu Asp Ala Tyr Ser His He Pro Pro Gin 

410 415 420 

Leu Met Glu Lys Cys Ala Ala Leu Ser Val Arg Pro Asp Thr Val 

425 430 435 

Arg Asn Leu Val Gin Ser Met Gin Val lieu Ser Gly Val Phe Thr 

440 445 450 

Asp Val Glu Ala Ser Leu Lys Asp He Arg Asp Leu Leu Glu Glu 

455 460 465 

Asp Glu Leu Leu Glu Gin Lys Phe Gin Glu Ala Val Gly Gin Ala 

470 475 480 

Gly Ala He Ser He Thr Ser Lys Ala Glu Leu Ala Glu Val Arg 

485 490 495 

Arg Glu Trp Ala Lys Tyr Met Glu Val His Glu Lys Ala Ser Phe 

500 505 510 

Thr Asn Ser Glu Leu His Arg Ala Met Asn Leu His Val Gly Asn 

515 520 525 

Leu Arg Leu Leu Ser Gly Pro Leu Asp Gin Val Arg Ala Ala Leu 

530 535 540 

Pro Thr Pro Ala Leu Ser Pro Glu Asp Lys Ala Val Leu Gin Asn 

545 550 555 

Leu Lys Arg He Leu Ala Lys Val Gin Glu Met Arg Asp Gin Arg 
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Asp Asp 
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uXU 
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Lys Lys Tyr Asp Gin Leu Lys 


Val Tyr 
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610 


615 


XiGU 


l3±U 




Asn 


Leu 


Ala 


Ala Gin Asp Arg Val Leu Cys 


Ala Leu 










620 




625 


630 


ThiT 


Glu 


AJLa 


Asn 


Val 


Gin 


Tyr Ala Ala Val Arg Arg Val 


Leu Ser 










635 




640 


645 


Asp 


Leu 


Asp 


Gin 


Lys 


Trp 


Asn Ser Thr Leu Gin Thr Leu 


Val Ala 










650 




655 


660 


Ser 


Tyr 


Glu 


Ala 


Tyr 


Glu 


Asp Leu Met Lys Lys Ser Gin 


Glu Gly 










665 




670 


675 


Arg 


Asp 


Phe 


Tyr 


Ala 


Asp 


Leu Glu Ser Lys Val Ala Ala 


Leu Leu 










680 




685 


690 


Glu 


Arg 


Thr 


Gin 


Ser 


Thr 


cys Gin Ala Arg Glu Ala Ala 


Arg Gin 










695 




700 
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Gin 


Leu 


Leu 


Asp 


Arg 


Glu 


Leu Lys Lys Lys Pro Pro Pro 


Arg Pro 










710 
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Thr 
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Pro 


Lys 


Pro 


Leu 


Leu Pro Arg Arg Glu Glu Ser 


Glu Ala 
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Val 


Glu 
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Gly 


Asp 


Pro 


Pro Glu Glu Leu Arg Ser Leu 


Pro Pro 
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Ala 


Gly 


Pro 
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Gly Ser 
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770 




775 
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Gly 


Pro 


Gly 


Pro 


His 


Tyr 


Leu Ser Gly Pro Leu Pro Pro 


Gly Thr 
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Tyr 


Ser 


Gly 


Pro 


Thr 
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Gly Pro 










800 




805 
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His 


Ala 


Met 


Pro 


Val 


Ala 


Pro Gly Pro Ala Leu Tyr Pro 


Ala Pro 
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820 
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Tyr 


Thr 


Pro 


Glu 
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Gly Leu Val Pro Arg Ser Ser 


Pro Gin 










830 
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840 


His 


Gly 


Val 


Val 


Ser 


Ser 


Pro Tyr Val Gly Val Gly Pro 


Ala Pro 
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850 


855 


Pro 


Val 
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Gly 


Leu 


Pro 


Ser Ala Pro Pro Pro Gin Phe 


Ser Gly 










860 




865 


870 


Pro 


Glu 


Leu 


Ala 


Met 
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Val Arg Pro Ala Thr Thr Thr 


Val Asp 










875 




880 
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Ser 


lie 


Gin 


Ala 


Pro 


Xle 


Pro Ser His Thr Ala Pro Arg 


Pro Asn 










890 
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Pro 


Thr 


Pro 


Ala 


Pro 


Pro 


Pro Pro Cys Phe Pro Val Pro 


Pro Pro 
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910 


915 


61x1 


Pro 


Leu 


Pro 


Thr 


Pro 


Tyr Thr Tyr Pro Ala Gly Ala 


Lys Gin 










920 




925 


930 


Pro 


lie 


Pro 


Ala 


Gin 


His 


His Phe Ser Ser Gly He Pro 


Thr Gly 
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940 
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Pne 


Pro 


Ala 


Pro 


Arg 


lie 


Gly Pro Gin Pro Gin Pro His 


Pro Gin 










950 




955 
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Pro 
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Pro 


Ser 


Gin 


Ala 


Phe Gly Pro Gin Pro Pro Gin 


Gin Pro 
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I>eu 


Pro 


Leu 


Gin 


His 
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Pro Gly 










980 
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Leu 
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Pro 


Pro 


Gin 
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Pro Tyr Pro Tyr Ala Pro Gin 


Pro Gly 










995 
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Val 
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Gly 


Gin 


Pro 


Pro 


Pro Pro Leu His Thr Gin Leu 


Tyr Pro 
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Gly 


Pro 
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Gin Asp 


Pro 


Leu Pro Ala His Ser Gly Ala 
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Phe Pro Ser Pro Gly Pro Pro Gin Pro Pro His Pro Pro Leu Ala 

1040 1045 1050 

Tyr Gly Pro Ala Pro Ser Thr Arg Pro Met Gly Pro Gin Ala Ala 

1055 1060 1065 

Pro Leu Thr lie Arg Gly Pro Ser Ser Ala Gly Gin Ser Thr Pro 

1070 1075 1080 

Ser Pro His Leu Val Pro Ser Pro Ala Pro Ser Pro Gly Pro Gly 

1085 1090 1095 

Pro Val Pro Pro Arg Pro Pro Ala Ala Glu Pro Pro Pro Cys Leu 

1100 1105 1110 

Arg Arg Gly Ala Ala Ala Ala Asp Leu Leu Ser Ser Ser Pro Glu 

1115 1120 1125 

Ser Gin His Gly Gly Thr Gin Ser Pro Gly Gly Gly Gin Pro Leu 

1130 1135 1140 

Leu Gin Pro Thr Lys Val Asp Ala Ala Glu Gly Arg Arg Pro Gin 

1145 1150 1155 

Ala Leu Arg Leu lie Glu Arg Asp Pro Tyr Glu His Pro Glu Arg 

1160 1165 1170 

Leu Arg Gin Leu Gin Gin Glu Leu Glu Ala Phe Arg Gly Gin Leu 

1175 1180 1185 

Gly Asp Val Gly Ala Leu Asp Thr Val Trp Arg Glu Leu Gin Asp 

1190 1195 1200 

Ala Gin Glu His Asp Ala Arg Gly Arg Ser lie Ala He Ala Arg 

1205 1210 1215 

Cys Tyr Ser Leu Lys Asn Arg His Gin Asp Val Met Pro Tyr Asp 

1220 1225 1230 

Ser Asn Arg Val Val Leu Arg Ser Gly Lys Asp Asp Tyr He Asn 

1235 1240 1245 

Ala Ser Cys Val Glu Gly Leu Ser Pro Tyr Cys Pro Pro Leu Val 

1250 1255 1260 

Ala Thr Gin Ala Pro Leu Pro Gly Thr Ala Ala Asp Phe Trp Leu 

1265 1270 1275 

Met Val His Glu Gin Lys Val Ser Val He Val Met Leu Val Ser 

1280 1285 1290 

Glu Ala Glu Met Glu Lys Gin Lys Val Ala Arg Tyr Phe Pro Thr 

1295 1300 1305 

Glu Arg Gly Gin Pro Met Val His Gly Ala Leu Ser Leu Ala Leu 

1310 1315 1320 

Ser Ser Val Arg Ser Thr Glu Thr His Val Glu Arg Val Leu Ser 

1325 1330 1335 

Leu Gin Phe Arg Asp Gin Ser Leu Lys Arg Ser Leu Val His Leu 

1340 1345 1350 

His Phe Pro Thr Trp Pro Glu Leu Gly Leu Pro Asp Ser Pro Ser 

1355 1360 1365 

Asn Leu Leu Arg Phe He Gin Glu Val His Ala His Tyr Leu His 

1370 1375 1380 

Gin Arg Pro Leu His Thr Pro He He Val His Cys Ser Ser Gly 

1385 1390 1395 

Val Gly Arg Thr Gly Ala Phe Ala Leu Leu Tyr Ala Ala Val Gin 

1400 1405 1410 

Glu Val Glu Ala Gly Asn Gly He Pro Glu Leu Pro Gin Leu Val 

1415 1420 1425 

Arg Arg Met Arg Gin Gin Arg Lys His Met Leu Gin Glu Lys Leu 

1430 1435 1440 

His Leu Arg Phe Cys Tyr Glu Ala Val Val Arg His Val Glu Gin 

1445 1450 1455 

Val Leu Gin Arg His Gly Val Pro Pro Pro Cys Lys Pro Leu Ala 

1460 1465 1470 

Ser Ala Ser He Ser Gin Lys Asn His Leu Pro Gin Asp Ser Gin 

1475 1480 1485 

Asp Leu Val Leu Gly Gly Asp Val Pro He Ser Ser He Gin Ala 

1490 1495 1500 

Thr He Ala Lys Leu Ser He Arg Pro Pro Gly Gly Leu Glu Ser 
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10 




15 


Gly 


Gly 


Pro 


Asn 


Pro 


Gly 


Pro 


Ala 


Gly 


Gly 


Arg Arg Pro Pro 


Arg 










20 










25 




30 


Ala 


Ala 


Gly 


Gly 


Ala 


Thr 


Ala 


Gly 


Ser 


Arg 


Gin Pro Ser Val 


Glu 










35 










40 




45 


Thr 


Leu 


Asp 


Ser 


Pro 


Thr 


Gly 


Ser 


His 


Val 


Glu Trp Cys Lys 


Gin 










50 










55 




60 


Leu 


lie 


Ala 


Ala 


Thr 


He 


Ser 


Ser 


Gin 


He 


Ser Gly Ser Val 


Thr 










65 










70 




75 


Ser 


Glu 


Asn 


Val 


Ser 


Arg 


Asp 


Tyr 


Lys 


Val 


Phe Arg Arg Pro 


Asp 










80 










85 




90 


Leu 


Arg 


Ala 


Leu 


Arg 


Asp 


Gly 


Asn 


Lys 


Leu 


Ala Gin Met Glu 


Glu 










95 










100 




105 


Ala 


Pro 


Leu 


Phe 


Pro 


Gly 


Glu 


Ser 


He 


Lys 


Ala He Val Lys 


Asp 










110 










115 




120 


Val 


Met 


Tyr 


He 


Cys 


Pro 


Phe 


Met 


Gly 


Ala 


Val Ser Gly Thr 


Leu 










125 










130 




135 


Thr 


Val 


Thr 


Asp 


Phe 


Lys 


Leu 


Tyr 


Phe 


Lys 


Asn Val Glu Arg 


Asp 










140 










145 




150 


Pro 


His 


Phe 


He 


I*eu 


Asp 


Val 


Pro 


Leu 


Gly 


Val He Ser Arg 


Val 










155 










160 




165 


Glu 


Lys 


He 


Gly 


Ala 


Gin 


Ser 


His 


Gly 


Asp 


Asn Ser Cys Gly 


He 










170 










175 




180 


Glu 


lie 


Val 


Cys 


Lys 


Asp 


Met 


Arg 


Asn 


Leu 


Tkrg Leu Ala Tyr 


Lys 










185 










190 


195 


Gin 


Glu 


Glu 


Gin 


Ser 


Lys 


Leu 


Gly 


He 


Phe 


Glu Asn Leu Asn 


Lys 










200 










205 




210 


His 


Ala 


Phe 


Pro 


Leu 


Ser 


Asn 


Gly 


Gin 


Ala 


Leu Phe Ala Phe 


Ser 










215 










220 




225 


Tyr 


Lys 


Glu 


Lys 


Phe 


Pro 


He 


Asn 


Gly 


Trp 


Lys Val Tyr Asp 


Pro 










230 










235 


240 


Val 


Ser 


Glu 


Tyr 


Lys 


Arg 


Gin 


Gly 


Leu 


Pro 


Asn Glu Ser Trp 


Lys 
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245 250 255 

lie Ser Lys lie Asn Ser Asn Tyr Glu Phe Cys Asp Thr Tyr Pro 

260 265 270 

Ala He lie Val Val Pro Thr Ser Val Lys Asp Asp Asp Leu Ser 

275 280 285 

Lys Val Ala Ala Phe Arg Ala Lys Gly Arg Val Pro Val Leu Ser 

290 295 300 

Trp He His Pro Glu Ser Gin Ala Thr He Thr Arg Cys Ser Gin 

305 310 315 

Pro Leu Val Gly Pro Asn Asp Lys Arg Cys Lys Glu Asp Glu Lys 

320 325 330 

Tyr Leu Gin Thr He Met Asp Ala Asn Ala Gin Ser His Lys Leu 

335 340 345 

He He Phe Asp Ala Arg Gin Asn Ser Val Ala Asp Thr Asn Lys 

350 355 360 

Thr Lys Gly Gly Gly Tyr Glu Ser Glu Ser Ala Tyr Pro Asn Ala 

365 370 375 

Glu Leu Val Phe Leu Glu He His Asn He His Val Met Arg Glu 

380 385 390 

Ser Leu Arg Lys Leu Lys Glu He Val Tyr Pro Ser He Asp Glu 

395 400 405 

Ala Arg Trp Leu Ser Asn Val Asp Gly Thr His Trp Leu Glu Tyr 

410 415 420 

He Arg Met Leu Leu Ala Gly Ala Val Arg He Ala Asp Lys He 

425 430 435 

Glu Ser Gly Lys Thr Ser Val Val Val His Cys Ser Asp Gly Trp 

440 445 450 

Asp Arg Thr Ala Gin Leu Thr Ser Leu Ala Met Leu Met Leu Asp 

455 460 465 

Ser Tyr Tyr Arg Thr He Lys Gly Phe Glu Thr Leu Val Glu Lys 

470 475 480 

Glu Trp He Ser Phe Gly His Arg Phe Ala Leu Arg Val Gly His 

485 490 495 

Gly Asn Asp Asn His Ala Asp Ala Asp Arg Ser Pro He Phe Leu 

500 505 510 

Gin Phe Val Asp Cys Val Trp Gin Met Thr Arg Gin Phe Pro Ser 

515 520 525 

Ala Phe Glu Phe Asn Glu Leu Phe Leu He Thr He Leu Asp His 

530 535 540 

Leu Tyr Ser Cys Leu Phe Gly Thr Phe Leu Cys Asn Cys Glu Gin 

545 550 555 

Gin Arg Phe Lys Glu Asp Val Tyr Thr Lys Thr He Ser Leu Trp 

560 565 570 

Ser Tyr He Asn Ser Gin Leu Asp Glu Phe Ser Asn Pro Phe Phe 

575 580 585 

Val Asn Tyr Glu Asn His Val Leu Tyr Pro Val Ala Ser Leu Ser 

590 595 600 

His Leu Glu Leu Trp Val Asn Tyr Tyr Val Arg Trp Asn Pro Arg 

605 610 615 

Met Arg Pro Gin Met Pro He His Gin Asn Leu Lys Glu Leu Leu 

620 625 . 630 

Ala Val Arg Ala Glu Leu Gin Lys Arg Val Glu Gly Leu Gin Arg 

635 640 645 

Glu Val Ala Thr Arg Ala Val Ser Ser Ser Ser Glu Arg Gly Ser 

650 655 660 

Ser Pro Ser His Ser Ala Thr Ser Val His Thr Ser Val 

665 670 

<210> 3 

<211> 459 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> mis cofeature 

<223> Incyte ID No: 72358203CD1 

<400> 3 

Met Ser Ala Gly Trp Phe Arg Arg Arg Phe Leu Pro Gly Glu Pro 
15 10 15 

Leu Pro Ala Pro Arg Pro Pro Gly Pro His Ala Ser Pro Val Pro 
20 25 30 

Tyr Arg Arg Pro Arg Phe Leu Arg Gly Ser Ser Ser Ser Pro Gly 
35 40 45 

Ala Ala Asp Ala Ser Arg Arg Pro Asp Ser Arg Pro Val Arg Ser 
50 55 60 

Pro Ala Arg Gly Arg Thr Leu Pro Trp Asn Ala Gly Tyr Ala Glu 
65 70 75 

He He Asn Ala Glu Lys Ser Glu Phe Asn Glu Asp Gin Ala Ala 
80 85 90 

Cys Gly Lys Leu Cys He Arg Arg Cys Glu Phe Gly Ala Glu Glu 
95 100 105 

Glu Trp Leu Thr Leu Cys Pro Glu Glu Phe Leu Thr Gly His Tyr 

110 115 120 

Trp Ala Leu Phe Asp Gly His Gly Gly Pro Ala Ala Ala He Leu 

125 130 135 

Ala Ala Asn Thr Leu His Ser Qys Leu Arg Arg Gin Leu Glu Ala 

140 145 150 

Val Val Glu Gly Leu Val Ala Thr Gin Pro Pro Met His Leu Asn 

155 160 165 

Gly Arg Cys He Cys Pro Ser Asp Pro Gin Phe Val Glu Glu Lys 

170 175 180 

Gly He Arg Ala Glu Asp Leu Val He Gly Ala Leu Glu Ser Ala 

185 190 195 

Phe Gin Glu Cys Asp Glu Val He Gly Arg Glu Leu Glu Ala Ser 

200 205 210 

Gly Gin Met Gly Gly Cys Thr Ala Leu Val Ala Val Ser Leu Gin 

215 220 225 

Gly Lys Leu Tyr Met Ala Asn Ala Gly Asp Ser Arg Ala He Leu 

230 235 240 

Val Arg Arg Asp Glu He Arg Pro Leu Ser Phe Glu Phe Thr Pro 

245 250 255 

Glu Thr Glu Arg Gin Arg He Gin Gin Leu Ala Phe Val Tyr Pro 

260 265 270 

Glu Leu Leu Ala Gly Glu Phe Thr Arg Leu Glu Phe Pro Arg Arg 

275 280 285 

Leu Lys Gly Asp Asp Leu Gly Gin Lys Val Leu Phe Arg Asp His 

290 295 300 

His Met Ser Gly Trp Ser Tyr Lys Arg Val Glu Lys Ser Asp Leu 

305 310 315 

Lys Tyr Pro Leu He His Gly Gin Gly Arg Gin Ala Arg Leu Leu 

320 325 330 

Gly Thr Leu Ala Val Ser Arg Gly Leu Gly Asp His Gin Leu Arg 

335 340 345 

Val Leu Asp Thr Asn He Gin Leu Lys Pro Phe Leu Leu Ser Val 

350 355 360 

Pro Gin Val Thr Val Leu Asp Val Asp Gin Leu Glu Leu Gin Glu 

365 370 375 

Asp Asp Val Val Val Met Ala Thr Asp Gly Leu Trp Asp Val Leu 

380 385 390 

Ser Asn Glu Gin Val Ala Trp Leu Val Arg Ser Phe Leu Pro Gly 

395 400 405 

Asn Gin Glu Asp Pro His Arg Phe Ser Lys Leu Ala Gin Met Leu 

410 415 420 

He His Ser Thr Gin Gly Lys Glu Asp Ser Leu Thr Glu Glu Gly 

425 .430 435 
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Gin Val Ser Tyr Asp Asp Val Ser Val Phe Val He Pro Leu His 
440 445 450 

Ser Gin Gly Gin Glu Ser Ser Asp His 
455 

<210> 4 
<211> 243 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 4885040CD1 

<400> 4 

Met Glu His Ala Phe Thr Pro Leu Glu Pro Leu Leu Ser Thr Gly 
1 5 10 15 

Asn Leu Lys Tyr qys Leu Val lie Leu Asn Gin Pro Leu Asp Asn 
20 25 30 

Tyr Phe Arg His Leu Trp Asn Lys Ala Leu Leu Arg Ala Cys Ala 
35 40 45 

Asp Gly Gly Ala Asn Arg Leu Tyr Asp He Thr Glu Gly Glu Arg 
50 55 60 

Glu Ser Phe Leu Pro Glu Phe He Asn Gly Asp Phe Asp Ser He 
65 70 75 

2trg Pro Glu Val Arg Glu Tyr Tyr Ala Thr Lys Gly Cys Glu Leu 
80 85 90 

He Ser Thr Pro Asp Gin Asp His Thr Asp Phe Thr Lys Cys Leu 
95 100 105 

Lys Met Leu Gin Lys Lys He Glu Glu Lys Asp Leu Lys Val Asp 
110 115 120 

Val He Val Thr Leu Gly Gly Leu Ala Gly Arg Phe Asp Gin He 
125 130 135 

Met Ala Ser Val Asn Thr Leu Phe Gin Ala Thr His He Thr Pro 
140 145 150 

Phe Pro He He He He Gin Glu Glu Ser Leu He Tyr Leu Leu 
155 160 165 

Gin Pro Gly Lys His Arg Leu His Val Asp Thr Gly Met Glu Gly 
170 175 180 

Asp Trp Cys Gly Leu He Pro Val Gly Gin Pro Qys Met Gin Val 
185 190 195 

Thr Thr Thr Gly Leu Lys Trp Asn Leu Thr Asn Asp Val Leu Ala 
200 205 210 

Phe Gly Thr Leu Val Ser Thr Ser Asn Thr Tyr Asp Gly Ser Gly 
215 220 225 

Val Val Thr Val Glu Thr Asp His Pro Leu Leu Trp Thr Met Ala 
230 235 240 

He Lys Ser 



<210> 5 
<211> 632 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^f eature 

<223> Incyte ID No: 7484507CD1 

<400> 5 

Met Leu Gly Pro Gly Ser Asn Arg Arg Arg Pro Thr Gin Gly Glu 
15 10 15 

Arg Gly Pro Gly Ser Pro Gly Glu Pro Met Glu Lys Tyr Gin Val 
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20 25 30 

Leu Tyr Gin lieu Asn Pro Gly Ala Leu Gly Val Asn Leu Val Val 
35 40 45 

Glu Glu Met Glu Thr Lys Val Lys His Val He Lys Gin Val Glu 
50 55 60 

Cys Met Asp Asp His Tyr Ala Ser Gin Ala Leu Glu Glu Leu Met 
65 70 75 

Pro Leu Leu Lys Leu Arg His Ala His He Ser Val Tyr Gin Glu 
80 85 90 

Leu Phe He Thr Trp Asn Gly Glu He Ser Ser Leu Tyr Leu Cys 
95 100 105 

Leu Val Met Glu Phe Asn Glu Leu Ser Phe Gin Glu Val He Glu 

110 115 120 

Asp Lys Arg Lys Ala Lys Lys He He Asp Ser Glu Trp Met Gin 

125 130 135 

Asn val Leu Gly Gin Val Leu Asp Ala Leu Glu Tyr Leu His His 

140 145 150 

Leu Asp He He His Arg I^sn Leu Lys Pro Ser Asn He He Leu 

155 160 165 

He Ser Ser Asp His Cys Lys Leu Gin Asp Leu Ser Ser Asn Val 

170 175 180 

Leu Met Thr Asp Lys Ala Lys Trp Asn He Arg Ala Glu Glu Asp 

185 190 195 

Pro Phe Arg Lys Ser Trp Met Ala Pro Glu Ala Leu Asn Phe Ser 

200 205 210 

Phe Ser Gin Lys Ser Asp He Trp Ser Leu Gly Qys He He Leu 

215 220 225 

Asp Met Thr Ser Cys Ser Phe Met Asp Gly Thr Glu Ala Met His 

230 235 240 

Leu Arg Lys Ser Leu Arg Gin Ser Pro Gly Ser Leu Lys Ala Val 

245 250 255 

Leu Lys Thr Met Glu Glu Lys Gin He Pro Asp Val Glu Thr Phe 

260 265 270 

Arg Asn Leu Leu Pro Leu Met Leu Gin He Asp Pro Ser Asp Arg 

275 280 285 

He Thr He Lys Asp Val Val His He Thr Phe Leu 3^g Gly Ser 

290 295 300 

Phe Lys Ser Ser Cys Val Ser Leu Thr Leu His Arg Gin Met Val 

305 310 315 

Pro Ala Ser He Thr Asp Met Leu Leu Glu Gly Asn Val Ala Ser 

320 325 330 

He Leu Gly Asp Ala Gly Asp Thr Lys Gly Glu Arg Ala Leu Lys 

335 340 345 

Leu Leu Ser Met Ala Leu Ala Ser Tyr Cys Leu Val Pro Glu Gly 

350 355 360 

Ser Leu Phe Met Pro Leu Ala Leu Leu His Met His Asp Gin Trp 

365 370 375 

Leu Ser Cys Asp Gin Asp Arg Val Pro Gly Lys Arg Asp Phe Ala 

380 385 390 

Ser Leu Gly Lys Leu Gly Lys Leu Leu Gly Pro He Pro Lys Gly 

395 400 405 

Leu Pro Trp Pro Pro Glu Leu Val Glu Val Val Val Thr Thr Met 

410 415 420 

Glu Leu His Asp Arg Val Leu Asp Val Gin Leu Cys Ala Cys Ser 

425 430 435 

Leu Leu Leu His Leu Leu Gly Gin Gly He He Val Asn Lys Ala 

440 445 450 

Pro Leu Glu Lys Val Pro Asp Leu He Ser Gin Val Leu Ala Thr 

455 460 465 

Tyr Pro Ala Asp Gly Glu Met Ala Glu Ala Ser Cys Gly Val Phe 

470 475 480 

Trp Leu Leu Ser Leu Leu Gly Cys He Lys Glu Gin Gin Phe Glu 

485 490 495 
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Gin 


Val 


Val Ala 


Leu Leu 








500 


Arg 


TQa 


Leu Leu 


Val Asn 








515 


Val 


Lys 


Val Ser 


Glu Leu 








530 


Glu 


Gly 


Gly Ser 


Gly Leu 








545 


His 


Arg 


Asp Asp 


Pro Glu 








560 


Val 


HiB 


Leu Ala 


Ser Tyr 








575 


Ser 


Ser 


Met Lys 


Ala Leu 








590 


Ser 


Ser 


Leu Glu 


Leu Val 








605 


lieu 


Glu 


Ala Ala 


Thr Ser 








620 


Gin 


Pro 







Leu Gin 
Asn Ala 
Ala Ala 
Ser Leu 
Val Val 
Glu Glu 
Leu Gin 
Ser Cys 
Pro Ser 



Ser 
Tyr 
Phe 
lie 
Glu 
lie 
Glu 
Ala 
Pro 



lie 
505 
Arg 
520 
Lys 
535 
Lys 
550 
T^n 
565 
Leu 
580 
lie 
595 
Glu 
610 
Leu 
625 



Arg Leu 
Gly Leu 
Val Val 
Glu Thr 
Val Gly 
Pro Glu 
Lys Glu 
Lys Val 
Gly Gly 



Cys Gin Asp 
510 

Ala Ser Leu 
525 

Val Gin Glu 
540 

Tyx Gin Leu 
555 

Met Leu Leu 
570 

Leu Val Ser 
585 

Arg Phe Thr 
600 

Leu Leu Arg 
615 

Glu Ala Ala 
630 



<210> 6 
<211> 1511 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7198931CD1 



<400> 6 

Met Ala Ala Ala Ala Gly Asn Arg Ala Ser Ser Ser Gly Phe Pro 
15 10 15 

Gly Ala Arg Ala Tlir Ser Pro Glu Ala Gly Gly Gly Gly Gly Ala 
20 25 30 

Leu Lys Ala Ser Ser Ala Arg Ala Ala Ala Ala Gly Leu Leu Arg 
35 40 45 

Glu Ala Gly Ser Gly Gly Arg Glu Arg Ala Asp Trp Arg Arg Arg 
50 55 60 

Gin Leu Arg Lys Val Arg Ser Val Glu Leu Asp Gin Leu Pro Glu 
65 70 75 

Gin Pro Leu Phe Leu Ala Ala Ser Pro Pro Ala Ser Ser Thr Ser 
80 85 90 

Pro Ser Pro Glu Pro Ala Asp Ala Ala Gly Ser Gly Thr Gly Phe 
95 100 105 

Gin Pro Val Ala Val Pro Pro Pro His Gly Ala Ala Ser Arg Arg 
110 115 120 

Gly Ala His Leu Thr Glu Ser Val Ala Ala Pro Asp Ser Gly Ala 
125 130 135 

Ser Ser Pro Ala Ala Ala Glu Pro Gly Glu Lys Arg Ala Pro Ala 
140 145 150 

Ala Glu Pro Ser Pro Ala Ala Ala Pro Ala Gly Arg Glu Met Glu 
155 160 165 

Asn Lys Glu Thr Leu Lys Gly Leu His Lys Met Asp Asp Arg Pro 
170 175 180 

Glu Glu Arg Mfet He Arg Glu Lys Leu Lys Ala Thr Cys Met Pro 
185 190 195 

Ala Trp Lys His Glu Trp Leu Glu Arg Arg Asn Arg Arg Gly Pro 
200 205 210 

Val Val Val Lys Pro He Pro Val Lys Gly Asp Gly Ser Glu Met 
215 220 225 

Asn His Leu Ala Ala Glu Ser Pro Gly Glu Val Gin Ala Ser Ala 
230 235 240 
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Ala Sex Pro Ala Ser Lys Gly Arg Arg Ser Pro Ser Pro Gly Asn 

245 250 255 

Ser Pro Ser Gly Arg Thr Val Lys Ser Glu Ser Pro Gly Val Arg 

260 265 270 

Arg Lys Arg Val Ser Pro Val Pro Phe Gin Ser Gly Arg lie Thr 

275 280 285 

Pro Pro Arg Arg Ala Pro Ser Pro Asp Gly Phe Ser Pro a*yr Ser 

290 295 300 

Pro Glu Glu Thr Asn Arg Arg Val Asn Lys Val Met Arg Ala Arg 

305 310 315 

Leu Tyr Leu Leu Gin Gin lie Gly Pro Asn Ser Phe Leu lie Gly 

320 325 330 

Gly Asp Ser Pro Asp Asn Lys Tyr Arg Val Phe lie Gly Pro Gin 

335 340 345 

Asn Cys Ser Cys Ala Aarg Gly Thr Phe Cys He His Leu Leu Phe 

350 355 360 

Val Met Leu Arg Val Phe Gin Leu Glu Pro Ser Asp Pro Met Leu 

365 370 375 

Trp Arg Lys Thr Leu Lys Asn Phe Glu Val Glu Ser Leu Phe Gin 

380 385 390 

Lys Tyr His Ser Arg Arg Ser Ser Arg He Lys Ala Pro Ser Arg 

395 400 405 

Asn Thr He Gin Lys Phe Val Ser Arg Met Ser Asn Ser His Thr 

410 415 420 

Leu Ser Ser Ser Ser Thr Ser Thr Ser Ser Ser Glu Asn Ser He 

425 430 435 

Lys Asp Glu Glu Glu Gin Met Cys Pro He Cys Leu Leu Gly Met 

440 445 450 

Leu Asp Glu Glu Ser Leu Thr Val Cys Glu Asp Gly Cys Arg Asn 

455 460 465 

Lys Leu His His His Cys Met Ser He Trp Ala Glu Glu Cys Arg 

470 475 480 

Arg Asn Arg Glu Pro Leu He Cys Pro Leu Cys Arg Ser Lys Trp 

485 490 495 

Arg Ser His Asp Phe Tyr Ser His Glu Leu Ser Ser Pro Val Asp 

500 505 510 

Ser Pro Ser Ser Leu Arg Ala Ala Gin Gin Gin Thr Val Gin Gin 

515 520 525 

Gin Pro Leu Ala Gly Ser Arg Arg Asn Gin Glu Ser Asn Phe Asn 

530 535 540 

Leu Thr His Tyr Gly Thr Gin Gin He Pro Pro Ala Tyr Lys Asp 

545 550 555 

Leu Ala Glu Pro Trp He Gin Val Phe Gly Met Glu Leu Val Gly 

560 565 570 

Cys Leu Phe Ser Arg Asn Trp Asn Val Arg Glu Met Ala Leu Arg 

575 580 585 

Arg Leu Ser His Asp Val Ser Gly Ala Leu Leu Leu Ala Asn Gly 

590 595 600 

Glu Ser Thr Gly Asn Ser Gly Gly Ser Ser Gly Ser Ser Pro Ser 

605 610 615 

Gly Gly Ala Thr Ser Gly Ser Ser Gin Thr Ser He Ser Gly Asp 

620 625 630 

Val Val Glu Ala Cys Cys Ser Val Leu Ser Met Val Cys Ala Asp 

635 640 645 

Pro Val Tyr Lys Val Tyr Val Ala Ala Leu Lys Thr Leu Arg Ala 

650 655 660 

Met Leu Val Tyr Thr Pro Cys His Ser Leu Ala Glu Arg He Lys 

665 670 675 

Leu Gin Arg Leu Leu Gin Pro Val Val Asp Thr He Leu Val Lys 

680 685 690 

cys Ala Asp Ala Asn Ser Arg Thr Ser Gin Leu Ser He Ser Thr 

695 700 705 

Leu Leu Glu Leu Cys Lys Gly Gin Ala Gly Glu Leu Ala Val Gly 
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710 






715 


720 


Arg 


Glu 


lie 


Leu 


Lys 


Ala Gly Ser 


lie 


Gly He 


Gly Gly Val Asp 










725 






730 


735 


Tyr 


Val 


Leu 


Asn 


Cys 


lie Leu Gly 


Asn 


Gin Thr 


Glu Ser Asn Asn 










740 






745 


750 


Trp 


Gin 


Glu 


Leu 


Leu 


Gly Arg Leu 


Cys 


Leu He 


Asp Arg Leu Leu 










755 






760 


765 


Leu 


Glu 


Phe 


Pro 


Ala 


Glu Phe Tyr 


Pro 


His Xle 


Val Ser Thr Asp 










770 






775 


780 


Val 


Ser 


Gin 


Ala 


Glu 


Pro Val Glu 


lie 


Arg Tyr 


Lys Lys Leu Leu 










785 






790 


795 


Ser 


Leu 


Leu 


Thr 


Phe 


Ala Leu Gin 


Ser 


He Asn 


Asn Ser His Ser 










800 






805 


810 


Met 


Val 


Gly 


Lys 


Leu 


Ser Arg Arg 


He 


Tyr Leu 


Ser Ser Ala Arg 










815 






820 


825 


Met 


Val 


Thr 


Thr 


Val 


Pro His Val 


Phe 


Ser Lys 


Leu Leu Glu Met 










830 




• 


835 


840 


Leu 


Ser 


Val 


Ser 


Ser 


Ser Thr His 


Phe 


Thr Arg 


Met Arg Arg Arg 










845 






850 


855 


Leu 


Met 


Ala 


lie 


Thr 


Asp Glu Val 


Glu 


Xle Ala 


Glu Ala Xle Gin 










860 






865 


870 


Leu 


Gly 


Val 


Glu 


Asp 


Thr Leu Asp 


Gly 


Gin Gin 


Asp Ser Phe Leu 










875 






880 


885 


61d 


Ala 


Ser 


Val 


Pro 


Asn Asn Tyr 


Leu 


Glu Thr 


Thr Glu Asn Ser 










890 






895 


900 


Ser 


Pro 


Glu 


Cys 


Thr 


lie His Leu 


Glu 


Lys Thr 


Gly Lys Gly Leu 










905 






910 


915 


Cys 


Ala 


Thr 


Lys 


Leu 


Ser Ala Ser 


Ser 


Glu Asp 


He Ser Glu Arg 










920 






925 


930 


Leu 


Ala 


Ser 


lie 


Ser 


Val Gly Pro 


Ser 


Ser Ser 


Thr Thr Thr Thr 










935 






940 


945 


Thr 


Thr 


Thr 


Glu 


Gin 


Pro Lys Pro 


Met 


Val Gin 


Thr Lys Gly Arg 










950 






955 


960 


Pro 


His 


Ser 


Gin 


Cys 


Leu Asn Ser 


Ser 


Pro Leu 


Ser His His Ser 










965 






970 


975 


Gin 


Leu 


Met 


Phe 


Pro 


Ala Leu Ser 


Thr 


Pro Ser 


Ser Ser Thr Pro 










980 






985 


990 


Ser 


Val 


Pro 


Ala 


Gly 


Thr Ala Thr 


Asp 


Val Ser 


Lys His Arg Leu 










995 




1000 


1005 


Gin 


Gly 


Phe 


lie 


Pro 


Cys Arg lie 


Pro 


Ser Ala 


Ser Pro Gin Thr 








1010 




1015 


1020 


Gin 


Arg 


Lys 


Phe 


Ser 


Leu Gin Phe 


His 


Arg Asn 


Cys Pro Glu Asn 








1025 




1030 


1035 


Lye 


Asp 


Ser 


Asp Lys 


Leu Ser Pro 


Val 


Phe Thr 


Gin Ser Arg Pro 








1040 




1045 


1050 


Leu 


Pro 


Ser 


Ser 


Asn 


Xle His Arg 


Pro Lys Pro 


Ser Arg Pro Thr 








1055 




1060 


1065 


Pro 


Gly 


Asn 


Thr 


Ser 


Lys Gin Gly 


Asp 


Pro Ser 


Lys Asn Ser Met 








1070 




1075 


1080 


Thr 


Leu 


Asp 


lieu 


Asn 


Ser Ser Ser 


Lys 


Cys Asp 


Asp Ser Phe Gly 








1085 




1090 


1095 


Cys 


Ser 


Ser 


Asn 


Ser 


Ser Asn Ala 


Val 


He Pro 


Ser Asp Glu Thr 








1100 




1105 


1110 


Val 


Phe 


Thr 


Pro 


Val 


Glu Glu Lys 


cys Arg Leu 


Asp Val Asn Thr 








1115 




1120 


1125 


Glu 


Leu 


Asn 


Ser 


Ser 


Xle Glu Asp 


Leu 


Leu Glu 


Ala Ser Met Pro 








1130 




1135 


1140 


Ser 


Ser 


Asp 


Thr Thr 


Val Thr Phe 


Lys 


Ser Glu 


Val Ala Val Leu 








1145 




1150 


1155 


Ser 


Pro 


Glu 


Lys Ala 


Glu Asn Asp 


Asp Thr Tyr 


Lys Asp Asp Val 








1160 




1165 


1170 


Asn 


His 


Asn 


Gin Lys 


cys Lys Glu 


Lys 


Met Glu 


Ala Glu Glu Glu 








1175 




1180 


1185 
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Glu Ala Leu Ala lie Ala Met Ala Met Ser Ala Ser Gin Asp Ala 
1190 1195 1200 

Leu Pro lie Val Pro Gin Leu Gin Val Glu Asn Gly Glu Asp lie 
1205 1210 1215 

lie lie lie Gin Gin Asp Thr Pro Glu Thr Leu Pro Gly His Thr 
1220 1225 1230 

Lys Ala Lys Gin Pro Tyr Arg Glu Asp Thr Glu Trp Leu Lys Gly 
1235 1240 1245 

Gin Gin lie Gly Leu Gly Ala Phe Ser Ser Cys Tyr Gin Ala Gin 
1250 1255 1260 

Asp Val Gly Thr Gly Thr Leu Met Ala Val Lys Gin Val Thr Tyr 
1265 1270 1275 

Val Arg Asn Thr Ser Ser Glu Gin Glu Glu Val Val Glu Ala Leu 
1280 1285 1290 

Arg Glu Glu He Arg Met Met Ser His Leu Asn His Pro Hsn He 
1295 1300 1305 

He Arg Met Leu Gly Ala Thr Cys Glu Lys Ser Asn Tyr Asn Leu 
1310 1315 1320 

Phe He Glu Trp Met Ala Gly Gly Ser Val Ala His Leu Leu Ser 
1325 1330 1335 

Lys Tyr Gly Ala Phe Lys Glu Ser Val Val He Asn Tyr Thr Glu 
1340 1345 1350 

Gin Leu Leu Arg Gly Leu Ser Tyr Leu His Glu Asn Gin He He 
1355 1360 1365 

His Arg Asp Val Lys Gly Ala Asn Leu Leu He Asp Ser Thr Gly 
1370 1375 1380 

Gin Arg Leu Arg He Ala Asp Phe Gly Ala Ala Ala Arg Leu Ala 
1385 1390 1395 

Ser Lys Gly Thr Gly Ala Gly Glu Phe Gin Gly Gin Leu Leu Gly 
1400 1405 1410 

Thr He Ala Phe Met Ala Pro Glu Val Leu Arg Gly Gin Gin Tyr 
1415 1420 1425 

Gly Arg Ser Cys Asp Val Ta^P Ser Val Gly Cys Ala He He Glu 
1430 1435 1440 

Met Ala Cys Ala Lys Pro Pro Trp Asn Ala Glu Lys His Ser Asn 
1445 1450 1455 

His Leu Ala Leu He Phe Lys He Ala Ser Ala Thr Thr Ala Pro 
1460 1465 1470 

Ser He Pro Ser His Leu Ser Pro Gly Leu Arg Asp Val Ala Leu 
1475 1480 1485 

Arg Cys Leu Glu Leu Gin Pro Gin Asp Arg Pro Pro Ser Arg Glu 
1490 1495 1500 

Leu Leu Lys His Pro Val Phe Arg Thr Thr Trp 



<210> 7 

<211> 830 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7482905CD1 

<400> 7 

Met Lys Ala Glu Gin Met Lys Arg Gin Glu Lys Glu Arg Leu Glu 
15 10 15 

Arg He Asn Arg Ala Arg Glu Gin Gly Trp Arg Asn Val Leu Ser 

20 25 30 

Ala Gly Gly Ser Gly Glu Val Lys Ala Pro Phe Leu Gly Ser Gly 
35 40 45 

Gly Thr He Ala Pro Ser Ser Phe Ser Ser Arg Gly Gin Tyr Glu 



1505 



1510 



50 



55 



60 
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His Tyr His Ala lie Phe Asp Gin Met Gin Gin Gin Arg Ala Glu 
65 70 75 

Asp Asn Glu Ala Lys Trp Lys Arg Glu lie Tyr Gly Arg Gly Leu 
80 85 90 

Pro Glu Arg Gin Lys Gly Gin Leu Ala Val Glu Arg Ala Lys Gin 
95 100 105 

Val Glu Glu Phe Leu Gin Arg Lys Arg Glu Ala Met Gin Asn Lys 

110 US 120 

Ala Arg Ala Glu Gly His Met Val Tyx Leu Ala Arg Leu Arg Gin 

125 130 135 

lie Arg Leu Gin Asn Phe Asn Glu Arg Gin Gin lie Lys Ala Lys 

140 145 150 

Leu Arg Gly Glu Lys Lys Glu Ala Asn His Ser Glu Gly Gin Glu 

155 160 165 

Gly Ser Glu Glu Ala Asp Mfet Arg Arg Lys Lys lie Glu Ser Leu 

170 175 180 

Lys Ala His Ala Asn Ala Arg Ala Ala Val Leu Lys Glu Gin Leu 

185 190 195 

Glu Arg Lys Arg Lys Glu Ala Tyr Glu Arg Glu Lys Lys Val Trp 

200 205 210 

Glu Glu His Leu Val Ala Lys Gly Val Lys Ser Ser Asp Val Ser 

215 220 225 

Pro Pro Leu Gly Gin His Glu Thr Gly Gly Ser Pro Ser Lys Gin 

230 235 240 

Gin Met Arg Ser Val He Ser Val Thr Ser Ala Leu Lys Glu Val 

245 250 255 

Gly Val Asp Ser Ser Leu Thr Asp Thr Arg Glu Thr Ser Glu Glu 

260 265 270 

Met Gin Lys Thr Asn Asn Ala He Ser Ser Lys Arg Glu He Leu 

275 280 285 

Arg Arg Leu Asn Glu Asn Leu Lys Ala Gin Glu Asp Glu Lys Gly 

290 295 300 

Lys Gin Asn Leu Ser Asp Thr Phe Glu He Asn Val His Glu Asp 

305 310 315 

Ala Lys Glu His Glu Lys Glu Lys Ser Val Ser Ser Asp Arg Lys 

320 325 330 

Lys Trp Glu Ala Gly Gly Gin Leu Val He Pro Leu Asp Glu Leu 

335 340 345 

Thr Leu Asp Thr Ser Phe Ser Thr Thr Glu Arg His Thr Val Gly 

350 355 360 

Glu Val He Lys Leu Gly Pro Asn Gly Ser Pro Arg Arg Ala Trp 

365 370 375 

Gly Lys Ser Pro Thr Asp Ser Val Leu Lys He Leu Gly Glu Ala 

380 385 390 

Glu Leu Gin Leu Gin Thr Glu Leu Leu Glu Asn Thr Thr He Arg 

395 400 405 

Ser Glu He Ser Pro Glu Gly Glu Lys Tyr Lys Pro Leu He Thr 

410 415 420 

Gly Glu Lys Lys Val Gin Cys He Ser His Glu He Asn Pro Ser 

425 430 435 

Ala He Val Asp Ser Pro Val Glu Thr Lys Ser Pro Glu Phe Ser 

440 445 450 

Glu Ala Ser Pro Gin Met Ser Leu Lys Leu Glu Gly Asn Leu Glu 

455 460 465 

Glu Pro Asp Asp Leu Glu Thr Glu He Leu Gin Glu Pro Ser Gly 

470 475 480 

Thr Asn Lys Asp Glu Ser Leu Pro Cys Thr He Thr Asp Val Trp 

485 490 495 

He Ser Glu Glu Lys Glu Thr Lys Glu Thr Gin Ser Ala Asp Arg 

500 505 510 

He Thr He Gin Glu Asn Glu Val Ser Glu Asp Gly Val Ser Ser 

515 520 525 

Thr Val Asp Gin Leu Ser Asp He His He Glu Pro Gly Thr Asn 
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535 


540 


Asp 




m s 


oer uys 


uys Asp Val Asp Lys Ser Val Gin 


Pro 










550 


555 


v>Xu 


Pro 


fxie fxie 


His Lys 


Val val His Ser Glu His Leu Asn 


Leu 








3DU 


c ^ tr 

565 


570 


veil 


Piro 


VaXn VaX 


Gin Ser 


Val Gin Cys Ser Pro Glu Glu Ser 


Phe 








3 73 


580 


585 


Ala 


Fxie 


Arg Ser 


His Ser 


His Leu Pro Pro Lys Asn Lys Asn 


Lys 








c n f\ 

590 


595 


600 


Asn 




Leu Leu 


lie Gly 


Leu Ser Thr Gly Leu Phe Asp Ala 


Asn 








505 


610 


615 


Asn 


Pro 


Lys Met 


Leu Arg 


Thr Cys Ser Leu Pro Asp Leu Ser 


Lys 








620 


625 


630 


Leu 


Pne 


Arg Thr 


Leu Met 


Asp Val Pro Thr Val Gly Asp Val 


Arg 








635 


640 


645 


61n 


Asp 


Asn Leu 


Glu He 


Asp Glu He Glu Asp Glu Asn He 


Lys 








650 


655 


660 


Glu 


Gly 


Pro Ser 


Asp Ser 


Glu Asp He Val Phe Glu Glu Thr 


Asp 








665 


670 


675 


Thr 


Asp 


Leu Gin 


Glu Leu 


Gin Ala Ser Met Glu Gin Leu Leu 


Arg 








680 


685 


690 


Glu 


Gin 


Pro Gly 


Glu Glu 


Tyr Ser Glu Glu Glu Glu Ser Val 


Leu 








695 


700 


705 


Lys 


Asn 


Ser Asp 


Val Glu 


Pro Thr Ala Asn Gly Thr Asp Val 


Ala 








710 


715 


720 


Asp 


Glu 


Asp Asp 


Asn Pro 


Ser Ser Glu Ser Ala Leu Asn Glu 


Glu 








725 


730 


735 


Trp 


His 


Ser Asp 


Asn Ser 


Asp Gly Glu He Ala Ser Glu Cys 


Glu 








740 


745 


750 


Cys 


Asp 


Ser Val 


Phe Asn 


His Leu Glu Glu Leu Arg Leu His 


Leu 








755 


760 


765 


Glu 


Gin 


Glu Met 


Gly Phe 


Glu Lys Phe Phe Glu Val Tyr Glu 


Lys 








770 


775 


780 


lie 


Lys 


Ala lie 


His Glu 


Asp Glu Asp Glu Asn He Glu He 


Cys 








785 


790 


795 


Ser 


Lys 


He Val 


Gin Asn 


He Leu Gly Asn Glu His Gin His 


Leu 








800 


805 


810 


Tyr 


Ala 


Lys He 


Leu His 


Leu Val Met Ala Asp Gly Ala Tyr 


Gin 








815 


820 


825 


Glu 


Asp 


Asn Asp 


Glu 












830 







<210> 8 
<211> 455 
<212> PRT 

<213> Homo sapiens 



<22Q> 

<221> inisc_feature 

<223> Incyte ID N6: 7483019CD1 



<400> 8 

Met Arg He Val Cys Leu Val Lys Asn Gin Gin Pro Leu Gly Ala 
15 10 15 

Thr He Lys Arg His Glu Met Thr Gly Asp He Leu Val Ala Arg 

20 25 30 

He He His Gly Gly Leu T^a Glu Arg Ser Gly Leu Leu Tyr Ala 

35 40 45 

Gly Asp Lys Leu Val Glu Val Asn Gly Val Ser Val Glu Gly Leu 

50 55 60 

Asp Pro Glu Gin Val He His He Leu Ala Met Ser Arg Gly Thr 

65 70 75 

He Met Phe Lys Val Val Pro Val Ser Asp Pro Pro Val Asn Ser 
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80 


Gin 


Gin 


Het 


Val 


Tyr 
95 


Glu 


Asp 


Pro 


Asp 


He 
110 


Lys 


Gly 


Asp 


lie 


Leu 
125 


Tit) 


Gin 


Ala 


Arg 


Lys 
140 


Val 


Pro 


Ser 


Asn 


His 
155 


Trp 


Ser 


Gin 


Pro 


Tyr 
170 


Tyr 


Lys 


Glu 


Glu 


Phe 
185 


Gly 


Phe 


Arg 


Arg 


Ser 
200 


Ser 


Pro 


Leu 


His 


Ala 
215 


Ala 


Val 


Gly 


Ala 


Pro 
230 


Pro 


Ser 


Asp 


Lys 


Tyr 
245 


Val 


Gly 


Val 


Asn 


Glu 
260 


Ser 


His 


Phe 


Gin 


Ser 
275 


Ser 


Tyr 


Glu 


Thr 


Asn 
290 


Thr 


Phe 


Glu 


Asn 


Leu 
305 


Glu 


Tyr 


Lys 


Gly 


His 
320 


Thr 


Val 


Leu 


Val 


Glu 
335 


Gin 


Asp 


lie 


Gin 


Gly 
350 


lie 


Phe 


lie 


Lys 


Pro 
365 


Lys 


Asn 


Ala 


Lys 


Val 
380 


Lys 


Asp 


Glu 


Asp 


Leu 
395 


Glu 


Thr 


Gin 


Phe 


Gly 
410 


Ser 


Leu 


His 


Asp 


Ala 
425 


Ala 


Gin 


Glu 


Glu 


Pro 
440 


Asp 


Thr 


Glu 


Ser 


Gin 
455 



85 90 
Val Arg Ala Met Thr Glu Tyr Trp Pro Gin 

100 105 
Pro Cys Met Asp Ala Gly Leu Pro Phe Gin 

115 120 
Gin He Val Asp Gin Asn Asp Ala Leu Trp 

130 135 
He Ser Asp Pro Ma Thr Cys Ala Gly Leu 

145 150 
Leu Leu Lys Arg Lys Gin Arg Glu Phe Trp 

160 165 
Gin Pro His Thr Cys Leu Lys Ser Thr Leu 

175 180 
Val Gly Tyr Gly Gin Lys Phe Phe He Ala 

190 195 
Met Arg Leu Cys Arg Arg Lys Ser His Leu 

205 210' 
Ser Val Cys Cys Thr Gly Ser Cys Tyr Ser 

220 225 
Tyr Glu Glu Val Val Arg Tyr Gin Arg Arg 

235 240 
Arg Leu He Val Leu Met Gly Pro Ser Gly 

250 255 
Leu Arg Arg Gin Leu He Glu Phe Asn Pro 

265 270 
Ala Val Pro His Thr Thr Arg Thr Lys Lys 

280 285 
Gly Arg Glu TVr His ^ryr Val Ser Lys Glu 

295 300 
He Tyr Ser His Arg Met Leu Glu Tyr Gly 

310 315 
Leu Tyr Gly Thr Ser Val Gly Ala Val Gin 

325 330 
Gly Lys He Cys Val Met Asp Leu Glu Pro 

340 345 
Val Arg Thr His Glu Leu Lys Pro Tyr Val 

355 360 
Ser Asn Met Arg Cys Met Lys Gin Ser Arg 

370 375 
He Thr Asp Tyr Tyr Val Asp Met Lys Phe 

385 390 
Gin Glu Met Glu Asn Leu Ala Gin Arg Met 

400 405 
Gin Phe Phe Asp His Val He Val. Asn Asp 

415 420 
cys Ala Gin Leu Leu Ser Ala He Gin Lys 

430 435 
Gin Trp Val Pro Ala Thr Trp He Ser Ser 

445 450 



<210> 9 
<211> 1720 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> iiiisc_feature 

<223> Incyte ID No: 5455490CD1 

<400> 9 

Met Met Lys Arg Arg Arg Glu Arg Leu Gly Ala Pro* Cys Leu Arg 
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1 



5 



10 



15 



lie Gin lie Ser Thr Leu Cys Arg Gly Ala Glu Val Asn Gin His 
20 25 30 

Met Phe Ser Pro Thr Ser Ala Pro Ala Leu Phe Leu Thr Lys Val 
35 40 45 

Pro Phe Ser Ala Asp Cys Ala Leu Ala Thr Ser Pro Leu Ala He 
50 55 60 

Phe Leu Asn Pro Arg Ala His Ser Ser Pro Gly Thr Pro Cys Ser 
65 70 75 

Ser Arg Pro Leu Pro Trp Ser Cys Arg Thr Ser Asn Arg Lys Ser 
80 85 90 

Leu He Val Thr Ser Ser Thr Ser Pro Thr Leu Pro Arg Pro His 
95 100 105 

Ser Pro Leu His Gly His Thr Gly Asn Ser Pro Leu Asp Ser Pro 

110 115 120 

Arg Asn Phe Ser Pro Asn Ala Pro Ala His Phe Ser Phe Val Pro 

125 130 135 

Ala Arg Ser His Ser His Arg Ala Asp Arg Thr Asp Gly Arg Arg 

140 145 150 

Trp Ser Leu Ala Ser Leu Pro Ser Ser Gly Tyr Gly Thr Asn Thr 

155 160 165 

Pro Ser Ser Thr Val Ser Ser Ser Cys Ser Ser Gin Glu Lys Leu 

170 175 180 

His Gin Leu Pro Phe Gin Pro Thr Ala Asp Glu Leu His Phe Leu 

185 190 195 

Thr Lys His Phe Ser Thr Glu Ser Val Pro Asp Glu Glu Gly Arg 

200 205 210 

Gin Ser Pro Ala Met Arg Pro Arg Ser Arg Ser Leu Ser Pro Gly 

215 220 225 

Arg Ser Pro Val Ser Phe Asp Ser Glu He He Met Met Asn His 

230 235 240 

Val Tyr Lys Glu Arg Phe Pro Lys Ala Thr Ala Gin Met Glu Glu 

245 250 255 

Arg Leu Ala Glu Phe He Ser Ser Asn Thr Pro Asp Ser Val Leu 

260 265 270 

Pro Leu Ala Asp Gly Ala Leu Ser Phe He His His Gin Val He 

275 280 285 

Glu Met Ala Arg Asp Cys Leu Asp Lys Ser Arg Ser Gly Leu He 

290 295 300 

Thr Ser Gin Tyr Phe Tyr Glu Leu Gin Glu Asn Leu Glu Lys Leu 

305 310 315 

Leu Gin Asp Ala His Glu Arg Ser Glu Ser Ser Glu Val Ala Phe 

320 325 330 

Val Met Gin Leu Val Lys Lys Leu Met He He He Ala Arg Pro 

335 340 345 

Ala Arg Leu Leu Glu Cys Leu Glu Phe Asp Pro Glu Glu Phe Tyr 

350 355 360 

His Leu Leu Glu Ala Ala Glu Gly His Ala Lys Glu Gly Gin Gly 

365 370 375 

He Lys Cys Asp He Pro Arg Tyr He Val Ser Gin Leu Gly Leu 

380 385 390 

Thr Arg Asp Pro Leu Glu Glu Met Ala Gin Leu Ser Ser Cys Asp 

395 400 405 

Ser Pro Asp Thr Pro Glu Thr Asp Asp Ser He Glu Gly His Gly 

410 415 420 

Ala Ser Leu Pro Ser Lys Lys Thr Pro Ser Glu Glu Asp Phe Glu 

425 430 435 

Thr He Lys Leu He Ser Asn Gly Ala Tyr Gly Ala Val Phe Leu 

440 445 450 

Val Arg His Lys Ser Thr Arg Gin Arg Phe Ala Met Lys Lys He 

455 460 465 

Asn Lys Gin Asn Leu He Leu Arg Asn Gin He Gin Gin Ala Phe 



470 



475 



480 
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Val 


Glu 


Arg 


Asp 


He 
485 


Leu 


Thr 


Phe 


Ala 


Glu 
490 


Asn 


Pro 


Phe 


Val Val 
495 


Ser 


Met 


Phe 


Cys 


Ser 
500 


Phe 


Asp 


Thr 


Lys 


Arg 
505 


His 


Leu 


Cys 


Met Val 
510 


Met 


-Glu 


Tyr 


Val 


Glu 
515 


Gly 


Gly 


Asp 


Cys 


Ala 
520 


Thr 


Leu 


Leu 


Lys Asn 
525 


He 


Gly 


Ala 


Leu 


Pro 
530 


Val 


Asp 


Met 


Val 


Arg 
535 


Leu 


Tyr 


Phe 


Ala Glu 
540 


Thr 


Val 


Leu 


Ala 


Leu 
545 


Glu 


Tyr 


Leu 


His 


Asn 
550 


Tyr 


Gly 


He 


Val His 
555 


Arg 


Asp 


Leu 


Lys 


Pro 
560 


Asp 


Asn 


Leu 


Leu 


He 
565 


Thr 


Ser 


Met 


Gly His 
570 


He 


Lys 


Leu 


Thr 


Asp 
575 


Phe 


Gly 


Leu 


Ser 


Lys 
580 


He 


Gly 


Leu 


Met Ser 
585 


Leu 


Thr 


Thr 


Asn 


Leu 
590 


Tyr 


Glu 


Gly 


His 


He 
595 


Glu 


Lys 


Asp 


Ala Arg 
600 


Glu 


Phe 


Leu 


Asp 


Lys 
605 


Gin 


Val 


Cys 


Gly 


Thr 
610 


Pro 


Glu 


Tyr 


He Ala 
615 


Pro 


dlu 


Val 


He 


Leu 
620 


Arg 


Gin 


Gly 


Tyr 


Gly 
625 


Lys 


Pro 


Val 


Asp Trp 
630 


Trp 


Ala 


Met 


Gly 


He 
635 


He 


Leu 


Tyr 


Glu 


Phe 
640 


Leu 


Val 


Gly 


Cys Val 
645 


Pro 


Phe 


Phe 


Gly 


Asp 
650 


Thr 


Pro 


Glu 


Glu 


Leu 
655 


Phe 


Gly 


Gin 


Val He 
660 


Ser 


Asp 


Glu 


He 


Val 
665 


Trp 


Pro 


Glu 


Gly 


Asp 
670 


Glu 


Ala 


Leu 


Pro Pro 
675 


Asp 


Ala 


Gin 


Asp 


Leu 
680 


Thr 


Ser 


Lys 


Leu 


Leu 
685 


His 


Gin 


Asn 


Pro Leu 
690 


Glu 


Arg 


Leu 


Gly 


Thr 
695 


Gly 


Ser 


Ala 


Tyr 


Glu 
700 


Val 


Lys 


Gin 


His Pro 
705 


Phe 


Phe 


Thr 


Gly 


Leu 
710 


Asp 


Trp 


Thr 


Gly 


Leu 
715 


Leu 


Arg 


Gin 


Lys Ala 
720 


Glu 


Phe 


He 


Pro 


Gin 
725 


Leu 


Glu 


Ser 


Glu 


Asp 
730 


Asp 


Thr 


Ser 


Tyr Phe 
735 


Asp 


Thr 


Arg 


Ser 


Glu 
740 


Arg 


Tyr 


His 


His 


Met 
745 


Asp 


Ser 


Glu 


Asp Glu 
750 


Glu 


Glu 


Val 


Ser 


Glu 
755 


Asp 


Gly 


Cys 


Leu 


Glu 
760 


He 


Arg 


Gin 


Phe Ser 
765 


Ser 


Cys 


Ser 


Pro 


Arg 
770 


Phe 


Asn 


Lys 


Val 


Tyr 
775 


Ser 


Ser 


Met 


Glu Arg 
780 


lieu 


Ser 


Leu 


Leu 


Glu 
785 


Glu 


Arg 


Arg 


Thr 


Pro 
790 


Pro 


Pro 


Thr 


Lys Arg 
795 


Ser 


Leu 


Ser 


Glu 


Glu 
800 


Lys 


Glu 


Asp 


His 


Ser 
805 


Asp 


Gly 


Leu 


Ala Gly 
810 


lieu 


Lys 


Gly 


Arg 


Asp 
815 


Arg 


Ser 


Trp 


Val 


He 
820 


Gly 


Ser 


Pro 


Glu He 
825 


Leu 


Arg 


Lys 


Arg 


Leu 
830 


Ser 


Val 


Ser 


Glu 


Ser 
835 


Ser 


His 


Thr 


Glu Ser 
840 


Asp 


Ser 


Ser 


Pro 


Pro 
845 


Met 


Thr 


Val 


Arg 


Arg 
850 


Arg 


Cys 


Ser 


Gly Leu 
855 


Leu 


Asp 


Ala 


Pro 


Arg 
860 


Phe 


Pro 


Glu 


Gly 


Pro 
865 


Glu 


Glu 


Ala 


Ser Ser 
870 


Thr 


Leu 


Arg 


Arg 


Gin 
875 


Pro 


Gin 


Glu 


Gly 


He 
880 


Trp 


Val 


Leu 


Thr Pro 
885 


Pro 


Ser 


Gly 


Glu 


Gly 
890 


Val 


Ser 


Gly 


Pro 


Val 
895 


Thr 


Glu 


His 


Ser Gly 
900 


(slU 


Gxn 


Arg 


Pro 


Lys 


Leu 


Asp 


Glu 


Glu 


Ala 


Val 


Gly 


Arg 


Ser Ser 










•905 










910 






915 


Gly 


Ser 


Ser 


Pro Ala 


Met 


Glu 


Thr 


Arg 


Gly 


Arg 


Gly 


Thr 


Ser Gin 










920 










925 








930 


Leu 


Ala 


Glu 


Gly Ala 


Thr 


Ala 


Lys 


Ala 


He 


Ser 


Asp 


Leu 


Ala Val 










935 










940 








945 


Arg 


Arg 


Ala 


Arg His 


Arg 


Leu 


Leu 


Ser 


Gly 


Asp 


Ser 


Thr 


Glu Lys 
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950 



955 



960 



Arg Thr Ala Arg Pro Val Asn Lys Val He Lys Ser Ala Ser Ala 
965 970 975 

Thr Ala Leu Ser Leu Leu He Pro Ser Glu His His Thr Cys Ser 
980 985 990 

Pro Leu Ala Ser Pro Met Ser Pro His Ser Gin Ser Ser Asn Pro 
995 1000 1005 

Ser Ser Arg Asp Ser Ser Pro Ser Arg Asp Phe Leu Pro Ala Leu 
1010 1015 1020 

Gly Ser Met Arg Pro Pro He He He His Arg Ala Gly Lys Lys 
1025 1030 1035 

Tyr Gly Phe Thr Leu Arg Ala He Arg Val Tyr Met Gly Asp Ser 
1040 1045 1050 

Asp Val Tyr Thr Val His His Met Val T3:p His Val Glu Asp Gly 
1055 1060 1065 

Gly Pro Ala Ser Glu Ala Gly Leu Arg Gin Gly Asp Leu He Thr 
1070 1075 1080 

His Val Asn Gly Glu Pro Val His Gly Leu Val His Thr Glu Val 
1085 1090 1095 

Val Glu Leu He Leu Lys Ser Gly Asn Lys Val Ala He Ser Thr 
1100 1105 1110 

Thx Pro Leu Glu Asn Thr Ser He Lys Val Gly Pro Ala Arg Lys 
1115 1120 1125 

Gly Ser Tyr Lys Ala Lys Met Ala Arg Arg Ser Lys Arg Ser Arg 
1130 1135 1140 

Gly Lys Asp Gly Gin Glu Ser Arg Lys Arg Ser Ser Leu Phe Arg 
1145 1150 1155 

Lys He Thr Lys Gin Ala Ser Leu Leu His Thr Ser Arg Ser Leu 
1160 1165 1170 

Ser Ser Leu Asn Arg Ser Leu Ser Ser Gly Glu Ser Gly Pro Gly 
1175 1180 1185 

Ser Pro Thr His Ser His Ser Leu Ser Pro Arg Ser Pro Thr Gin 
1190 1195 1200 

Gly Tyr Arg Val Thr Pro Asp Ala Val His Ser Val Gly Gly Asn 
1205 1210 1215 

Ser Ser Gin Ser Ser Ser Pro Ser Ser Ser Val Pro Ser Ser Pro 
1220 1225 1230 

Ala Gly Ser Gly His Thr Arg Pro Ser Ser Leu His Gly Leu Ala 
1235 1240 1245 

Pro Lys Leu Gin Arg Gin Tyr Arg Ser Pro Arg Arg Lys Ser Ala 
1250 1255 1260 

Gly Ser He Pro Leu Ser Pro Leu Ala His Thr Pro Ser Pro Pro 
1265 1270 1275 

Pro Pro Thr Ala Ser Pro Gin Arg Ser Pro Ser Pro Leu Ser Gly 
1280 1285 1290 

His Val Ala Gin Ala Phe Pro Thr Lys Leu His Leu Ser Pro Pro 
1295 1300 1305 

Leu Gly Arg Gin Leu Ser Arg Pro Lys Ser Ala Glu Pro Pro Arg 
1310 1315 1320 

Ser Pro Leu Leu Lys Arg Val Gin Ser Ala Glu Lys Leu Ala Ala 
1325 1330 1335 

Ala Leu Ala Ala Ser Glu Lys Lys Leu Ala Thr Ser Arg Lys His 
1340 1345 1350 

Ser Leu Asp Leu Pro His Ser Glu Leu Lys Lys Glu Leu Pro Pro 
1355 1360 1365 

Arg Glu Val Ser Pro Leu Glu Val Val Gly Ala Arg Ser Val Leu 
1370 1375 1380 

Ser Gly Lys Gly Ala Leu Pro Gly Lys Gly Val Leu Gin Pro Ala 
1385 1390 1395 

Pro Ser T^g Ala Leu Gly Thr Leu Arg Gin Asp Arg Ala Glu Arg 
1400 1405 1410 

Arg Glu Ser Leu Gin Lys Gin Glu Ala He Arg Glu Val Asp Ser 



1415 



1420 



1425 
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Ser Glu Asp Asp Thr Glu Glu Gly Pro Glu Asn Ser Glu Gly Ala 

1430 1435 1440 

Gin Glu Leu Ser Leu Ala Pro His Pro Glu Val Ser Gin Ser Val 

1445 1450 1455 

Ala Pro Lys Gly Ala Gly Glu Ser Gly Glu Glu Asp Pro Phe Pro 

1460 1465 1470 

Ser Arg Asp Pro Arg Ser Leu Gly Pro Met Val Pro Ser Leu Leu 

1475 1480 1485 

Thr Gly He Thr Leu Gly Pro Pro Arg Met Glu Ser Pro Ser Gly 

1490 1495 1500 

Pro His Arg Arg Leu Gly Ser Pro Gin Ala He Glu Glu Ala Ala 

1505 1510 1515 

Ser Ser Ser Ser Ala Gly Pro Asn Leu Gly Gin Ser Gly Ala Thr 

1520 1525 1530 

Asp Pro He Pro Pro Glu Gly Cys Trp Lys Ala Gin His Leu His 

1535 1540 1545 

Thr Gin Ala Leu Thr Ala Leu Ser Pro Ser Thr Ser Gly Leu Thr 

1550 1555 1560 

Pro Thr Ser Ser Cys Ser Pro Pro Ser Ser Thr Ser Gly Lys Leu 

1565 1570 1575 

Ser Met Trp Ser Trp Lys Ser Leu He Glu Gly Pro Asp Arg Ala 

1580 1585 1590 

Ser Pro Ser Arg Lys Ala Thr Met Ala Gly Gly Leu Ala Asn Leu 

1595 1600 1605 

Gin Asp Leu Glu Asn Thr Thr Pro Ala Gin Pro Lys Asn Leu Ser 

1610 1615 1620 

Pro Arg Glu Gin Gly Lys Thr Gin Pro Pro Ser Ala Pro Arg Leu 

1625 1630 1635 

Ala His Pro Ser Tyr Glu Asp Pro Ser Gin Gly Trp Leu Trp Glu 

1640 1645 1650 

Ser Glu Cys Ala Gin Ala Val Lys Glu Asp Pro Ala Leu Ser He 

1655 1660 1665 

Thr Gin Val Pro Asp Ala Ser Gly Asp Arg Arg Gin Asp Val Pro 

1670 1675 1680 

cys Arg Gly Cys Pro Leu Thr Gin Lys Ser Glu Pro Ser Leu Arg 

1685 1690 1695 

Arg Gly Gin Glu Pro Gly Gly His Gin Lys His Arg Asp Leu Ala 

1700 1705 1710 

Leu Val Pro Asp Glu Leu Leu Lys Gin Thr 

1715 1720 

<210> 10 

<211> 449 

<212> PUT 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 5547067CD1 

<400> 10 

Met Leu Met Gly Phe Cys Arg Leu Glu Glu Ala Gly Leu Val Ser 
15 10 15 

Arg Ser He Arg Glu Arg Asn Cys Leu Tyr Asn Trp Asp Ser Arg 

20 25 30 

Phe Ser Arg Glu Arg Arg Gin Arg Leu Gly Met Gly Ala Val Ser 

35 40 45 

Cys Arg Gin Gly Gin His Thr Gin Gin Gly Glu His Thr Arg Val 

50 55 60 

Ala Val Pro His Lys Gly Gly Asn He Arg Gly Pro Trp Ala Arg 

65 70 75 

Gly Trp Lys Ser Leu Trp Thr Gly Leu Gly Thr He Arg Ser Asp 

80 85 90 
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Deu Glu Glu Leu Trp Glu Leu Arg Gly His His Tyr Leu His Gin 
95 100 105 

Glu Ser Leu Lys Pro Ala Pro Val Leu Val Glu Lys Pro Leu Pro 

110 115 120 

Glu Trp Pro Val Pro Gin Phe He Asn Leu Phe Leu Pro Glu Phe 

125 130 135 

Pro He Arg Pro He Arg Gly Gin Gin Gin Leu Lys He Leu Gly 

140 145 150 

Leu Val Ala Lys Gly Ser Phe Gly Thr Val Leu Lys Val Leu Asp 

155 160 165 

Cys Thr Gin Lys Ala Val Phe Ala Val Lys Val Val Pro Lys Val 

170 175 180 

Lys Val Leu Gin Arg Asp Thr Val Arg Gin Cys Lys Glu Glu Val 

185 190 195 

Ser He Gin Arg Gin He Asn His Pro Phe Val His Ser Leu Gly 

200 205 210 

Asp Ser Trp Gin Gly Lys Arg His Leu Phe He Met Cys Ser Tyr 

215 220 225 

Cys Ser Thr Asp Leu Tyr Ser Leu Trp Ser Ala Val Gly Cys Phe 

230 235 240 

Pro Glu Ala Ser He Arg Leu Phe Ala Ala Glu Leu Val Leu Val 

245 250 255 

Leu Cys Tyr Leu His Asp Leu Gly He Met His Arg Asp Val Lys 

260 265 270 

Met Glu Asn He Leu Leu Asp Glu Arg Gly His Leu Lys Leu O^ir 

275 280 285 

Asp Phe Gly Leu Ser Arg His Val Pro Gin Gly Ala Gin Ala Tyr 

290 295 300 

Thr He Cys Gly Thr Leu Gin Tyr Met Ala Pro Glu Val Leu Ser 

305 310 315 

Gly Gly Pro Tyr Asn His Ala Ala Asp Trp Trp Ser Leu Gly Val 

320 325 330 

Leu Leu Phe Ser Leu Ala Thr Gly Lys Phe Pro Val Ala Ala Glu 

335 340 345 

Arg Asp His Val Ala Met Leu Ala Ser Val Thr His Ser Asp Ser 

350 355 360 

Glu He Pro Ala Ser Leu Asn Gin Gly Leu Ser Leu Leu Leu His 

365 370 375 

Glu Leu Leu Cys Gin Asn Pro Leu His Arg Leu Arg l^r Leu His 

380 385 390 

His Phe Gin Val His Pro Phe Phe Arg Gly Val Ala Phe Asp Pro 

395 400 405 

Glu Leu Leu Gin Lys Gin Pro Val Asn Phe Val Thr Glu Thr Gin 

410 415 420 

Ala Thr Gin Pro Ser Ser Ala Glu Thr Met Pro Phe Asp Asp Phe 

425 430 435 

Asp Cys Asp Leu Glu Ser Phe Leu Leu Tyr Pro He Pro Ala 

440 445 



<210> 11 

<211> 358 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 71675660CD1 

<400> 11 

Met Asp Asp Ala Thr Val Leu Arg Lys Lys Gly Tyr He Val Gly 

15 10 15 

He Asn Leu Gly Lys Gly Ser Tyr Ala Lys Val Lys Ser Ala Tyr 

20 25 30 
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Ser Glu Arg Leu Lys Phe Asn Val Ala Val hys He He Asp Arg 
35 40 45 

Lys Lys Thr Pro Thr Asp Phe Val Glu Arg Phe Leu Pro Arg Glu 
50 55 60 

Met Asp He Leu Ala Thr Val Asn His Gly Ser He He Lys Thr 
65 70 75 

Tyr Glu He Phe Glu Thr Ser Asp Gly Arg He Tyr He He Met 
80 85 90 

Glu Leu Gly Val Gin Gly Asp Leu Leu Glu Phe He Lys Cys Gin 
95 100 105 

Gly Ala Leu His Glu Asp Val Ala Arg Lys Met Phe Arg Gin Leu 
110 115 120 

Ser Ser Ala Val Lys Tyr Cys His Asp Leu Asp He Val His Arg 
.125 130 . 135 

Asp Leu Lys Cys Glu Asn Leu Leu Leu Asp Lys Asp Phe Asn He 
140 145 150 

Lys Leu Ser Asp Phe Gly Phe Ser Lys Arg Cys Leu Arg Asp Ser 
155 160 165 

Asn Gly Arg He He Leu Ser Lys Thr Phe Cys Gly Ser Ala Ala 
170 175 180 

Tyr Ala Ala Pro Glu Val Leu Gin Ser He Pro Tyr Gin Pro Lys 
185 190 195 

Val Tyr Asp He Trp Ser Leu Gly Val He Leu Tyr He Met Val 
200 205 210 

Cys Gly Ser Met Pro Tyr Asp Asp Ser Asp He Arg Lys Met Leu 
215 220 225 

TVrg He Gin Lys Glu His Arg Val Asp Phe Pro Arg Ser Lys Asn 
230 235 240 

Leu Thr Cys Glu Cys Lys Asp Leu He Tyr Arg Met Leu Gin Pro 
245 250 255 

Asp Val Ser Gin Arg Leu His He Asp Glu He Leu Ser His Ser 
260 265 270 

Trp Leu Gin Pro Pro Lys Pro Lys Ala Met Ser Ser Ala Ser Phe 
275 280 285 

Lys Arg Glu Gly Glu Gly Lys Tyr Arg Ala Glu Cys Lys Leu Asp 
290 295 300 

Thr Lys Thr Gly Leu Arg Pro Asp His Arg Pro Asp His Lys Leu 
305 310 315 

Gly Ala Lys Thr Gin His Arg Leu Leu Val Val Pro Glu Asn Glu 
320 325 330 

Asn Arg Met Glu Asp Arg Leu Ala Glu Thr Ser Arg Ala Lys Asp 
335 340 345 

His His He Ser Gly Ala Glu Val Gly Lys Ala Ser Thr 



<210> 12 

<211> 358 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> itiisc_feature 

<223> Incyte ID No: 71678683CD1 

<400> 12 

Met Asp Asp Ala Thr Val Leu Arg Lys Lys Gly Tyr He Val Gly 
15 10 15 

He Asn Leu Gly Lys Gly Ser Tyr Ala Lys Val Lys Ser Ala Tyr 
20 25 30 

Ser Glu Arg Leu Lys Phe Asn Val Ala Val Lys He He Asp Arg 
35 40 45 

Lys Lys Thr Pro Thr Asp Phe Val Glu Arg Phe Leu Pro Arg Glu 



350 



355 



50 



55 



60 
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Met Asp lie Leu Ala Thr Val Asn His Gly Ser He He Lys Thr 
65 70 75 

Tyr Glu He Phe Glu Thr Ser Asp Gly Arg He Tyr He He Met 
80 85 90 

Glu Leu Gly Val Gin Gly Asp Leu Leu Glu Phe He Lys Cys Gin 
95 100 105 

Gly Ala Leu His Glu Asp Val Ala Arg Lys Met Phe Arg Gin Leu 
HO 115 120 

Ser Ser Ala Val Lys Tyr Cys His Asp Leu Asp He Val His Arg 
125 130 135 

Asp Leu Lys Cys Glu Asn Leu Leu Leu Asp Lys Asp Phe Asn He 
140 145 150 

Lys Leu Ser Asp Phe Gly Phe Ser Lys Arg Cys Leu Arg Asp Ser 
155 160 165 

Asn Gly Arg He He Leu Ser Lys Thr Phe Cys Gly Ser Ala Ala 
170 175 180 

Tyr Ala Ala Pro Glu Val Leu Gin Ser He Pro Tyr Gin Pro Lys 
185 190 195 

Val Tyr Asp He Trp Ser Leu Gly Val He Leu Tyr He Met Val 
200 205 210 

Cys Gly Ser Met Pro Tyr Asp Asp Ser Asp He Arg Lys Met Leu 
215 220 225 

Arg He Gin Lys Glu His Arg Val Asp Phe Pro Arg Ser Lys Asn 
230 235 240 

Leu Thr Cys Glu Cys Lys Asp Leu He Tyr Arg Met Leu Gin Pro 
245 250 255 

Asp Val Ser Gin Arg Leu His He Asp Glu He Leu Ser His Ser 
260 265 270 

Trp Leu Gin Pro Pro Lys Pro Lys Ala Thr Ser Ser Ala Ser Phe 
275 280 285 

Lys Arg Glu Gly Glu Gly Lys Tyr Arg Ala Glu Cys Lys Leu Asp 
290 295 300 

Thr Lys Thr Gly Leu Arg Pro Asp His Arg Pro Asp His Lys Leu 
305 310 315 

Gly Ala Lys Thr Gin His Arg Leu Leu Val Val Pro Glu Asn Glu 
320 325 330 

Asn Arg Met Glu Asp Arg Leu Ala Glu Thr Ser Arg Ala Lys Asp 
335 340 345 

His His He Ser Gly Ala Glu Val Gly Lys Ala Ser Thr 



<210> 13 

<211> 929 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_feat\are 

<223> Incyte ID No: 7474567CD1 

<400> 13 

Met Glu Ser Met Leu Asn Lys Leu Lys Ser Thr Val Thr Lys Val 
15 10 15 

Thr Ala Asp Val Thr Ser Ala Val Met Gly Asn Pro Val Thr Arg 
20 25 30 

Glu Phe* Asp Val Gly Arg His He Ala Ser Gly Gly Asn Gly Leu 
35 40 45 

Ala Trp Lys He Phe Asn Gly Thr Lys Lys Ser Thr Lys Gin Glu 
50 55 60 

Val Ala Val Phe Val Phe Asp Lys Lys Leu He Asp Lys Tyr Gin 
65 70 75 

Lys Phe Glu Lys Asp Gin He He Asp Ser Leu Lys Arg Gly Val 



350 



355 



80 



85 



90 
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Gin Gin I*eu Thr Arg Leu Arg His Pro Arg Leu Leu Thr Val Gin 
95 100 105 

His Pro Leu Glu Glu Ser Arg Asp Cys Leu Ala Phe Cys Thr Glu 

110 115 120 

Pro Val Phe Ala Ser Leu Ala Asn Val Leu Gly Asn Trp Glu Asn 

125 130 135 

Leu Pro Ser Pro lie Ser Pro Asp lie Lys Asp Tyr Lys Leu Tyr 

140 145 150 

Asp Val Glu Thr Lys Tyr Gly Leu Leu Gin Val Ser Glu Gly Leu 

155 160 165 

Ser Phe Leu His Ser Ser Val Lys Met Val His Gly Asn lie Thr 

170 175 180 

Pro Glu Asn lie lie Leu Asn Lys Ser Gly Ala Tzp Lys lie Met 

185 190 195 

Gly Phe Asp Phe Cys Val Ser Ser Thr Asn Pro Ser Glu Gin Glu 

200 205 210 

Pro Lys Phe Pro Cys Lys Glu Trp Asp Pro Asn Leu Pro Ser Leu 

215 220 225 

Cys Leu Pro Asn Pro Glu Tyr Leu Ala Pro Glu Tyr He Leu Ser 

230 235 240 

Val Ser Cys Glu Thr Ala Ser Asp Met Tyr Ser Leu Gly Thr Val 

245 250 255 

Met Tyr Ala Val Phe Asn Lys Gly Lys Pro He Phe Glu Val Asn 

260 265 270 

Lys Gin Asp He Tyr Lys Ser Phe Ser Arg Gin Leu Asp Gin Leu 

275 280 285 

Ser Arg Leu Gly Ser Ser Ser Leu Thr Asn He Pro Glu Glu Val 

290 295 300 

Arg Glu His Val Lys Leu Leu Leu Asn Val Thr Pro Thr Val Arg 

305 310 315 

Pro Asp Ala Asp Gin Met Thr Lys He Pro Phe Phe Asp Asp Val 

320 325 330 

Gly Ala Val Thr Leu Gin Tyr Phe Asp Thr Leu Phe Gin Arg Asp 

335 340 345 

Asn Leu Gin Lys Ser Gin Phe Phe Lys Gly Leu Pro Lys Val Leu 

350 355 360 

Pro Lys Leu Pro Lys Arg Val He Val Gin Arg He Leu Pro Cys 

365 370 375 

Leu Thr Ser Glu Phe Val Asn Pro Asp Met Val Pro Phe Val Leu 

380 385 390 

Pro Asn Val Leu Leu He Ala Glu Glu Cys Thr Lys Glu Glu Tyr 

395 400 405 

Val Lys Leu He Leu Pro Glu Leu Gly Pro Val Phe Lys Gin Gin 

410 415 420 

Glu Pro He Gin He Leu Leu He Phe Leu Gin Lys Met Asp Leu 

425 430 435 

Leu Leu Thr Lys Thr Pro Pro Asp Glu He Lys Asn Ser Val Leu 

440 445 450 

Pro Met Val Tyr Arg Ala Leu Glu Ala Pro Ser He Gin He Gin 

455 460 465 

Glu Leu Cys Leu Asn He He Pro Thr Phe Ala Asn Leu He Asp 

470 475 480 

Tyr Pro Ser Met Lys Asn Ala Leu He Pro Arg He Lys Asn Ala 

485 490 495 

Cys Leu Gin Thr Ser Ser Leu Ala Val Arg Val Asn Ser Leu Val 

500 505 510 

Cys Leu Gly Lys He Leu Glu Tyr Leu Asp Lys Trp Phe Val Leu 

515 520 525 

Asp Asp He Leu Pro Phe Leu Gin Gin He Pro Ser Lys Glu Pro 

530 535 540 

Ala Val Leu Met Gly He Leu Gly He Tyr Lys Cys Thr Phe Thr 

545 550 555 

His Lys Lys Leu Gly He Thr Lys Glu Gin Leu Ala Gly Lys Val 
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560 



565 



570 



I*eu Pro His Leu lie Pro Leu Ser lie 61u Asn Asn Leu Asn Leu 
575 580 585 

Asn Gin Phe Asn Ser Phe lie Ser Val lie Lys Glu Met Leu Asn 
590 595 600 

Arg Leu Glu Ser Glu His Lys Thr Lys Leu Glu Gin Leu His lie 
605 610 615 

Met Gin Glu Gin Gin Lys Ser Leu Asp He Gly Asn Gin Met Asn 
620 625 630 

Val Ser Glu Glu Met Lys Val Thr Asn He Gly Asn Gin Gin He 
635 640 645 

Asp Lys Val Phe Asn Asn He Gly Ala Asp Leu Leu Thr Gly Ser 
650 655 660 

Glu Ser Glu Asn Lys Glu Asp Gly Leu Gin Asn Lys His Lys Arg 
665 670 675 

Ala Ser Leu Thr Leu Glu Glu Lys Gin Lys Leu Ala Lys Glu Gin 
680 685 690 

Glu Gin Ala Gin Lys Leu Lys Ser Gin Gin Pro Leu Lys Pro Gin 
695 700 705 

Val His Thr Pro Val Ala Thr Val Lys Gin Thr Lys Asp Leu Thr 
710 715 720 

Asp Thr Leu Met Asp Asn Met Ser Ser Leu Thr Ser Leu Ser Val 
725 730 735 

Ser Thr Pro Lys Ser Ser Ala Ser Ser Thr Phe Thr Ser Val Pro 
740 745 750 

Ser Met Gly He Gly Met Met Phe Ser Thr Pro Thr Asp Asn Thr 
755 760 765 

Lys Arg Asn Leu Thr Asn Gly Leu Asn Ala Asn Met Gly Phe Gin 
770 775 780 

Thr Ser Gly Phe Asn Met Pro Val Asn Thr Asn Gin Asn Phe Tyr 
785 790 795 

Ser Ser Pro Ser Thr Val Gly Val Thr Lys Met Thr Leu Gly Thr 
800 805 810 

Pro Pro Thr Leu Pro Asn Phe Asn Ala Leu Ser Val Pro Pro Ala 
815 820 825 

Gly Ala Lys Gin Thr Gin Gin Arg Pro Thr Asp Met Ser Ala Leu 
830 835 840 

Asn Asn Leu Ptie Gly Pro Gin Lys Pro Lys Val Ser Met Asn Gin 
845 850 855 

Leu Ser Gin Gin Lys Pro Asn Gin Trp Leu Asn Gin Phe Val Pro 
860 865 870 

Pro Gin Gly Ser Pro Thr Met Gly Ser Ser Val Met Gly Thr Gin 
875 880 885 

Met Asn Val He Gly Gin Ser Ala Phe Gly Met Gin Gly Asn Pro 
890 895 900 

Phe Phe Asn Pro Gin Asn Phe Ala Gin Pro Pro Thr Thr Met Thr 
905 910 915 

Asn Ser Ser Ser Ala Ser Asn Asp Leu Lys Asp lieu Phe Gly 



<210> 14 
<211> 523 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> miscjeattire 

<223> Incyte ID No: 3838946CD1 

<400> 14 

Met Ala Ala Ala Leu Gin Val Leu Pro Arg Leu Ala Arg Ala Pro 
15 10 15 

Leu His Pro Leu Leu Trp Arg Gly Ser Val Ala Arg Leu Ala Ser 



920 



925 
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20 



25 



30 



Ser Met Ala Leu Ala 61u Gin Ala Arg Gin Leu Phe Glu Ser Ala 
35 40 45 

Val Gly Ala Val Leu Pro Gly Pro Met Leu His Arg Ala Leu Ser 
50 55 60 

Leu Asp Pro Gly Gly Arg Gin Leu Lys Val Arg Asp Arg Asn Phe 
65 70 75. 

Gin Leu Arg Gin Asn Leu Tyr Leu Val Gly Phe Gly Lys Ala Val 
80 85 90 

Leu Gly Met Ala Ala Ala Ala Glu Glu Leu Leu Gly Gin His Leu 
95 100 105 

Val Gin Gly Val He Ser Val Pro Lys Gly He Arg Ala Ala Met 
110 115 120 

Glu Arg Ala Gly Lys Gin Glu Met lieu Leu Lys Pro His Ser Arg 
125 130 135 

Val Gin Val Phe Glu Gly Ala Glu Asp Asn Leu Pro Asp Arg Asp 
140 145 150 

Ala Leu Arg Ala Ala Leu Ala He Gin Gin Leu Ala Glu Gly Leu 
155 160 165 

Thr Ala Asp Asp Leu Leu Leu Val Leu He Ser Gly Gly Gly Ser 
170 175 180 

Ala Leu Leu Pro Ala Pro He Pro Pro Val Thr Leu Glu Glu Lys 
185 190 195 

Gin Thr Leu Thr Arg Leu Leu Ala Ala Arg Gly Ala Thr He Gin 
200 205 210 

Glu Leu Asn Thr He Arg Lys Ala Leu Ser Gin Leu Lys Gly Gly 
215 220 225 

Gly Leu Ala Gin Ala Ala Tyr Pro Ala Gin Val Val Ser Leu He 
230 235 240 

Leu Ser Asp Val Val Gly Asp Pro Val Glu Val He Ala Ser Gly 
245 250 255 

Pro Thr Val Ala Ser Ser His Asn Val Gin Asp Cys Leu His He 
260 265 270 

Leu Asn Arg Tyr Gly Leu Arg Ala Ala Leu Pro Arg Ser Val Lys 
275 280 285 

Thr Val Leu Ser Arg Ala Asp Ser Asp Pro His Gly Pro His Thr 
290 295 300 

Cys Gly His Val Leu Asn Val He He Gly Ser Asn Val Leu Ala 
305 310 315 

Leu Ala Glu Ala Gin Arg Gin Ala Glu Ala Leu Gly Tyr Gin Ala 
320 325 330 

Val Val Leu Ser Ala Ala Met Gin Gly Asp Val Lys Ser Met Ala 
335 340 345 

Gin Phe Tyr Gly Leu Leu Ala His Val Ala Arg Thr Arg Leu Thr 
350 355 360 

Pro Ser Met Ala Gly Ala Ser Val Glu Glu Asp Ala Gin Leu His 
365 370 375 

Glu Leu Ala Ala Glu Leu Gin He Pro Asp Leu Gin Leu Glu Glu 
380 385 390 

Ala Leu Glu Thr Met Ala Trp Gly Arg Gly Pro Val Cys Leu Leu 
395 400 405 

Ala Gly Gly Glu Pro Thr Val Gin Leu Gin Gly Ser Gly Arg Gly 
410 415 420 

Gly Arg Asn Gin Glu Leu Ala Leu Arg Val Gly Ala Glu Leu Arg 
425 430 435 

Arg Trp Pro Leu Gly Pro He Asp Val Leu Phe Leu Ser Gly Gly 
440 445 450 

Thr Asp Gly Gin Asp Gly Pro Thr Glu Ala Ala Gly Ala Trp Val 
455 460 465 

Thr Pro Glu Leu Ala Ser Gin Ala Ala Ala Glu Gly Leu Asp He 
470 475 480 

Ala Thr Phe Leu Ala His Asn Asp Ser His Thr Phe Phe Cys Cys 



485 



490 



495 
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Leu Gin Gly Gly Ala His Leu Leu His Thr Gly Met Thr Gly Thr 
500 505 510 

Asn val Met Asp Thr His Leu Leu Phe Leu Arg Pro Arg 
515 520 

<210> 15 
<211> 459 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> niiscjeature 

<223> Incyte ID No: 72001176CD1 

<400> 15 



Met 


Asp 


His 


Pro Ser Arg Glu 


Lys Asp Glu Arg Gin Arg Thr 


Thr 


1 






5 


10 


15 


Lys 


Pro 


Met 


Ala Gin Arg Ser 


Ala His Cys Ser Arg Pro Ser 


Gly 








20 


25 


30 


Ser 


Ser 


Ser 


Ser Ser Gly Val 


Leu Met Val Gly Pro Asn Phe 


Arg 








35 


40 


45 


Val 


Gly 


Lys 


Lys He Gly Cys 


Gly Asn Phe Gly Glu Leu Arg 


Leu 








50 


55 


60 


Gly 


Lys 


Asn 


Leu Tyr Thr Asn 


Glu Tyr Val Ala He Lys Leu 


Glu 








65 


70 


75 


Pro 


He 


Lys 


Ser Arg Ala Leu 


Gin Leu His Leu Glu Tyr Arg 


Phe 








80 


85 


90 


Tyr 


Lys 


Gin 


Leu Gly Ser Ala 


Gly Glu Gly Leu Pro Gin Val 


Tyr 








95 


100 


105 


Tyr 


Phe 


Gly 


Pro Cys Gly Lys 


Tyr Asn Ala Met Val Leu Glu 


Leu 








110 


115 


120 


Leu 


Gly 


Pro 


Ser Leu Glu Asp 


Leu Phe Asp Leu Cys Asp Arg 


Thr 








125 


130 


135 


Phe 


Thr 


Leu 


Lys Thr Val Leu 


Met He Ala He Gin Leu Leu 


Ser 








140 


145 


150 


Arg 


Met 


Glu 


Tyr Val His Ser 


Lys Asn Leu He Tyr Arg Asp 


Val 








155 


160 


165 


Lys 


Pro 


Glu 


Asn Phe Leu He 


Gly Arg Gin Gly Asn Lys Lys 


Glu 








170 


175 


180 


His 


Val 


He 


His He He Asp 


Phe Gly Leu Ala Lys Glu Tyr 


He 








185 


190 


195 


Asp 


Pro 


Glu 


Thr Lys Lys His 


He Pro Tyr Arg Glu His Lys 


Ser 








200 


205 


210 


Leu 


Thr 


Gly 


Thr Ala Arg Tyr 


Mfet Ser He Asn Thr His Leu 


Gly 








215 


220 


225 


Lys 


Glu 


Gin 


Ser Arg Arg Asp 


Asp Leu Glu Ala Leu Gly His 


Met 








230 


235 


240 


Phe 


Met 


Tyr 


Phe Leu Arg Gly 


Ser Leu Pro Trp Gin Gly Leu 


Lys 








245 


250 


255 


Ala 


Asp 


Thr 


Leu Lys Glu Arg 


Tyr Gin Lys He Gly Asp Thr 


Lys 








260 


265 


270 


Arg 


Asn 


Thr 


Pro He Glu Ala 


Leu Cys Glu Asn Phe Pro Glu 


Glu 








275 


280 


285 


Met 


Ala 


Thr 


Tyr Leu Arg Tyr 


Val Arg Arg Leu Asp Phe Phe 


Glu 








290 


295 


300 


Lys 


Pro 


Asp 


Tyr Glu Tyr Leu 


Arg Thr Leu Phe Thr Asp Leu 


Phe 


Glu 






305 


310 


315 


Lys 


Lys 


Gly Tyr Thr Phe 


Asp Tyr Ala Tyr Asp Trp Val 


Gly 








320 


325 


330 


Arg 


Pro 


He 


Pro Thr Pro Val 


Gly Ser Val His Val Asp Ser 


Gly 








335 


340 


345 


Ala 


Ser 


Ala 


He Thr Arg Glu 


Ser His Thr His Arg Asp Arg 


Pro 








350 


355 


360 
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Ser Gin Gin Gin Pro Leu Arg Asn Gin Asn Val Ser Ser Glu Arg 

365 370 375 

Arg Gly Glu Trp Glu He Gin Pro Ser Arg Gin Thr Asn Thr Ser 

380 385 390 

Tyr Leu Tlir Ser His Leu Ala Ala Asp Arg His Gly Gly Ser Val 

395 400 405 

Gin Val Val Ser Ser Thr Asn Gly Glu Leu Asn Val Asp Asp Pro 

410 415 420 

Thr Gly Ala His Ser Asn Ala* Pro He Thr Ala His Ala Glu Val 

425 430 435 

Glu Val Val Glu Glu Ala Lys Cys Cys Cys Phe Phe Lys Arg Lys 

440 445 450 

Arg Lys Lys Thr Ala Gin Arg His Lys 

455 

<210> 16 

<211> 1360 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 55064363CD1 

<400> 16 



Met 


Lys 


Trp 


Val 


Gly Asp 


Thr 


Gly Val Gly Gly Asn He Pro 


Pro 


1 








5 




10 


15 


Ser 


Phe 


Thr 


Thr 


Pro Gly 


Leu 


Ser Ser Arg Pro Gly Ala Met 


Val 










20 




25 


30 


Ala 


Asp 


Arg 


Ser 


Arg Trp 


Pro 


Leu Ala Gin Gly Lys Gly Ala 


Gin 










35 




40 


45 


Ala 


Gly 


Thr 


Tip 


Arg Ala 


Ala 


Val Glu Cys Ser Gly Arg Gly 


Leu 










50 




55 


60 


Gly 


Taa 


Ala 


Ser 


Glu Ser 


Pro 


Gin Cys Pro Pro Pro Pro Gly 


Val 










65 




70 


75 


Glu 


Gly 


Ala 


Ala 


Gly Pro 


Ala 


Glu Pro Asp Gly Ala Ala Glu 


Gly 










80 




85 


90 


Ala 


Ala 


Gly 


Gly 


Ser Gly 


Glu 


Gly Glu Ser Gly Gly Gly Pro 


Arg 










95 




100 


105 


Arg 


Ala 


Leu 


Arg 


Ala Val 


Tyr 


Val Arg Ser Glu Ser Ser Gin 


Gly 










110 




115 


120 


Gly 


Ala 


Ala 


Gly 


Gly Pro 


Glu 


Ala Gly Ala Arg Gin Cys Leu 


Leu 










125 




130 


135 


Arg 


Ala 


Cys 


Glu 


Ala Glu 


Gly 


Ala His Leu Thr Ser Val Pro 


Phe 










140 




145 


150 


Gly 


Glu 


Leu 


Asp 


Phe Gly 


Glu 


Thr Ala Val Leu Asp Ala Phe 


Tyr 










155 




160 


165 


Asp 


Ala 


Asp 


Val 


Ala Val 


Val 


Asp Met Ser Asp Val Ser Arg 


Gin 










170 




175 


180 


Pro 


Ser 


Leu 


Phe 


Tyr His 


Leu 


Gly Val Arg Glu Ser Phe Asp 


Met 










185 




190 


195 


Ala 


Asn 


Asn 


Val 


He Leu 


Tyr 


His Asp Thr Asp Ala Asp Thr 


Ala 










200 




205 


210 


Leu 


Ser 


Leu 


Lys 


Asp Met 


Val 


Thr Gin Lys Asn Thr Ala Ser 


Ser 










215 




220 


225 


Gly 


Asn 


Tyr 


Tyr 


Phe He 


Pro 


Tyr He Val Thr Pro Cys Thr 


Asp 










230 




235 


240 


Tyr 


Phe 


Cys 


Cys 


Glu Ser 


Asp 


Ala Gin Arg Arg Ala Ser Glu 


Tyr 










245 




250 


255 


Met 


Gin 


Pro 


Asn 


Tzp Asp 


Asn 


He Leu Gly Pro Leu Cys Met 


Pro 










260 




265 


270 


Leu 


val 


Asp 


Arg 


Phe He 


Ser 


Leu Leu Lys Asp He His Val 


Thr 










275 




280 


285 
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Ser Cys Val Tyr Tyr Lys Glu Thr Leu Leu Asn Asp lie Arg Lys 

290 295 300 

Ala Arg Glu Lys Tyr Gin Gly Glu Glu Leu Ala Lys Glu Leu Ala 

305 310 315 

Arg lie Lys Leu Arg Met Asp Asn Thr Glu Val Leu Thr Ser Asp 

320 325 330 

lie lie lie Asn Leu Leu Leu Ser Tyr Arg Asp He Gin Asp Tyr 

335 340 345 

Asp Ala Met Val Lys Leu Val Glu Thr Leu Glu Met Leu Pro Thr 

350 355 360 

Cys Asp Leu Ala Asp Gin His Hisn lie Lys Phe His Tyr Ala Phe 

365 370 375 

Ala Leu Asn Arg Arg Asn Ser Thr Gly Asp Arg Glu Lys Ala Leu 

380 385 390 

Gin He Met Leu Gin Val Leu Gin Ser Cys Asp His Pro Gly Pro 

395 400 405 

Asp Met Phe Cye Leu Cys Gly Arg He Tyr Lys Asp He Phe Leu 

410 415 420 

Asp Ser Asp Cys Lys Asp Asp Thr Ser Arg Asp Ser Ala He Glu 

425 430 435 

Trp Tyr Arg Lys Gly Phe Glu Leu Gin Ser Ser Leu Tyr Ser Gly 

440 445 450 

He Asn Leu Ala Val Leu Leu He Val Ala Gly Gin Gin Phe Glu 

455 460 465 

Thr Ser Leu Glu Leu Arg Lys He Gly Val Arg Leu Asn Ser Leu 

470 475 480 

Leu Gly Arg Lys Gly Ser Leu Glu Lys Met Asn Asn Tyr Tip Asp 

485 490 495 

Val Gly Gin Phe Phe Ser Val Ser Met Leu Ala His T^p Val Gly 

500 505 510 

Lys Ala Val Gin Ala Ala Glu Arg Leu Phe Lys Leu Lys Pro Pro 

515 520 525 

Val Trp Tyr Leu Arg Ser Leu Val Gin Asn Leu Leu Leu He Arg 

530 535 540 

Arg Phe Lys Lys Thr He He Glu His Ser Pro Arg Gin Glu Arg 

545 550 555 

Leu Asn Phe Trp Leu Asp He He Phe Glu Ala Thr Asn Glu Val 

560 565 570 

Thr Asn Gly Leu Arg Phe Pro Val Leu Val He Glu Pro Thr Lys 

575 580 585 

Val Tyr Gin Pro Ser Tyr Val Ser He Asn Asn Glu Ala Glu Glu 

590 595 600 

Arg Thr Val Ser Leu Trp His Val Ser Pro Thr Glu Met Lys Gin 

605 610 615 

Met His Glu Trp Asn Phe Thr Ala Ser Ser He Lys Gly He Ser 

620 625 630 

Leu Ser Lys Phe Asp Glu Arg Cys Cys Phe Leu Tyr Val His Asp 

635 640 645 

Asn Ser Asp Asp Phe Gin He Tyr Phe Ser Thr Glu Glu Gin Cys 

650 655 660 

Ser Arg Phe Phe Ser Leu Val Lys Glu Met He Thr Asn Thr Ala 

665 670 675 

Gly Ser Thr Val Glu Leu Glu Gly Glu Thr Asp Gly Asp Thr Leu 

680 685 690 

Glu Tyr Glu Tyr Asp His Asp Ala Asn Gly Glu Arg Val Val Leu 

695 700 705 

Gly Lys Gly Thr Tyr Gly He Val Tyr Ala Gly Arg Asp Leu Ser 

710 715 720 

Asn Gin Val Arg He Ala He Lys Glu He Pro Glu Arg Asp Ser 

725 730 735 

Arg Tyr Ser Gin Pro Leu His Glu Glu He Ala Leu His Lys Tyr 

740 745 750 

Leu Lys His Arg Asn He Val Gin Tyr Leu Gly Ser Val Ser Glu 
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755 760 765 

Asn Gly Tyr He Lys He Phe Met Glu Gin Val Pro Gly Gly Ser 
770 775 780 

Leu Ser Ala Leu Leu Arg Ser Lys Trp Gly Pro Met Lys Glu Pro 
785 790 795 

Thr He Lys Phe Tyr Thr Lys Gin lie Leu Glu Gly Leu Lys Tyr 
800 805 810 

Leu His Glu Asn Gin lie Val His Arg Asp He Lys Gly Asp Asn 
815 820 825 

Val Leu Val Asn Thr Tyr Ser Gly Val Val Lys He Ser Asp Phe 
830 835 840 

Gly Thr Ser Lys Arg Leu Ala Gly Val Asn Pro Cys Thr Glu Thr 
845 850 855 

Phe Thr Gly Thr Leu Gin Tyr Met Ala Pro Glu He He Asp Gin 
860 865 870 

Gly Pro Arg Gly Tyr Gly Ala Pro Ala Asp He Trp Ser Leu Gly 
875 880 885 

Cys Thr He He Glu Mfet Ala Thr Ser Lys Pro Pro Phe His Glu 
890 895 900 

Leu Gly Glu Pro Gin Ala Ala Met Phe Lys Val Gly Met Phe Lys 
905 • 910 915 

He His Pro Glu He Pro Glu Ala Leu Ser Ala Glu Ala Arg Ala 
920 925 930 

Phe He Leu Ser Cys Phe Glu Pro Asp Pro His Lys Arg Ala Thr 
935 940 945 

Thr Ala Glu Leu Leu Arg Glu Gly Phe Leu Arg Gin Val Asn Lys 
950 955 960 

Gly Lys Lys Asn Arg He Ala Phe Lys Pro Ser Glu Gly Pro Arg 
965 970 975 

Gly Val Val Leu Ala Leu Pro Thr Gin Gly Glu Pro Met Ala Thr 
980 985 990 

Ser Ser Ser Glu His Gly Ser Val Ser Pro Asp Ser Asp Ma Gin 
995 1000 1005 

Pro Asp Ala Leu Phe Glu Arg Thr Arg Ala Pro Arg His His Leu 
1010 1015 1020 

Gly His Leu Leu Ser Val Pro Asp Glu Ser Ser Ala Leu Glu Asp 
1025 1030 1035 

Arg Gly Leu Ala Ser Ser Pro Glu Asp Arg Asp Gin Gly Leu Phe 
1040 1045 1050 

Leu Leu Arg Lys Asp Ser Glu Arg Arg Ala He Leu Tyr Lys He 
1055 1060 1065 

Leu Trp Glu Glu Gin Asn Gin Val Ala Ser Asn Leu Gin Glu Cys 
1070 1075 1080 

Val Ala Gin Ser Ser Glu Glu Leu His Leu Ser Val Gly His He 
1085 1090 1095 

Lys Gin He He Gly He Leu Arg Asp Phe He Arg Ser Pro Glu 
1100 1105 1110 

His Arg Val Met Ala Thr Thr He Ser Lys Leu Lys Val Asp Leu 
1115 1120 1125 

Asp Phe Asp Ser Ser Ser He Ser Gin He His Leu Val Leu Phe 
1130 1135 1140 

Gly Phe Gin Asp Ala Val Asn Lys He Leu Arg Asn His Leu He 
1145 1150 1155 

Arg Pro His Trp Met Phe Ala Met Asp Asn He He Arg Arg Ala 
1160 1165 1170 

Val Gin Ala Ala Val Thr He Leu He Pro Glu Leu Arg Ala His 
1175 1180 1185 

Phe Glu Pro Thr Cys Glu Thr Glu Gly Val Asp Lys Asp Met Asp 
1190 1195 1200 

Glu Ala Glu Glu Gly Tyr Pro Pro Ala Thr Gly Pro Gly Gin Glu 
1205 1210 1215 

Ala Gin Pro His Gin Gin His Leu Ser Leu Gin Leu Gly Glu Leu 
1220 1225 1230 
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Arg 


Gin 


Glu 


Thr Asn Arg 


Leu Leu Glu His Leu 


Val Glu Lys Glu 








1235 


1240 


1245 


Arg 


Glu 


Tyr 


Gin Asn Leu 


Leu Arg Gin Thr Leu 


Glu Gin Lys Thr 








1250 


1255 


1260 


Gin 


Glu 


Leu 


Tyr His Leu 


Gin Leu Lys Leu Lys 


Ser Asn Cys He 








1265 


1270 


1275 


Thr 


Glu 


Tlsn 


Pro Ala Gly 


Pro Tyr Gly Gin Arg 


Thr Asp Lys Glu 








1280 


1285 


1290 


Leu 


He 


Asp 


Tarp Leu Arg 


Leu Gin Gly Ala Asp 


Ala Lys Thr He 








1295 


1300 


1305 


61u 


Lys 


He 


Val Glu Glu 


Gly Tyr Thr Leu Ser 


Asp He Leu Asn 








1310 


1315 


1320 


61u 


He 


Thr 


Lys Glu Asp 


Leu Arg Tyr Leu Asrg 


Leu Arg Gly Gly 








1325 


1330 


1335 


Leu 


Leu 


Cys 


Arg Leu Trp 


Ser Ala Val Ser Gin 


Tyr Arg Arg Ala 








1340 


1345 


1350 


61x1 


Glu 


Ala 


Ser Glu Thr 


Lys Asp Lys Ala 










1355 


1360 





<210> 17 
<211> 1345 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc^feature 

<223> Incyte ID No: 7482044CD1 

<400> 17 



Met 


Glu 


Pro 


Gly 


Arg Gly 


Ala 


Gly Pro 


Ala Gly Met Ala Glu 


Pro 


1 








5 






10 


15 


Arg 


Ala 


Lys 


Ala 


Ala Arg 


Pro 


Gly Pro 


Gin Arg Phe Leu Arg 


Arg 










20 






25 


30 


Ser 


Val 


Val 


Glu 


Ser Asp 


Gin 


Glu Glu 


Pro Pro Gly Leu Glu 


Ala 










35 






40 


45 


Ala 


Glu 


Ala 


Pro 


Gly Pro 


Gin 


Pro Pro 


Gin Pro Leu Gin Arg 


Arg 










50 






55 


60 


Val 


Leu 


Leu 


Leu 


Cys Lys 


Thr 


Arg Arg 


Leu He Ala Glu Arg 


Ala 










65 






70 


75 


Arg 


Gly 


Arg 


Pro 


Ala Ala 


Pro 


Ala Pro 


Ala Ala Leu Val Ala 


Gin 










80 






85 


90 


Pro 


Gly 


Ala 


Pro 


Gly Ala 


Pro 


Ala Asp 


Ala Gly Pro Glu Pro 


Val 










95 






100 


105 


Gly 


Tlir 


Gin 


Glu 


Pro Gly 


Pro 


Asp Pro 


He Ala Ala Ala Val 


Glu 










110 






115 


120 


Thr 


jaa 


Pro 


Ala 


Pro Asp 


Gly 


Gly Pro 


Arg Glu Glu Ala Ala 


Ala 










125 






130 


135 


Thr 


Val 


Arg 


Lys 


Glu Asp 


Glu 


Gly Ala 


Ala Glu Ala Lys Pro 


Glu 










140 






145 


150 


Pro 


Gly 


Arg 


Thr 


Arg Arg 


Asp 


Glu Pro 


Glu Glu Glu Glu Asp 


Asp 










155 






160 


165 


Glu 


Asp 


Asp 


Leu 


Lys Ala 


Val 


Ala Thr 


Ser Leu Asp Gly Arg 


Phe 










170 






175 


180 


Leu 


Lys 


Phe 


Asp 


He Glu 


Leu 


Gly Arg 


Gly Ser Phe Lys Thr 


Val 










185 






190 


195 


Tyr 


Lys 


Gly 


Leu 


Asp Thr 


Glu 


Thr Trp 


Val Glu Val Ala Trp 


Cys 










200 






205 


210 


Glu 


Leu 


Gin 


Asp 


Arg Lys 


Leu 


Thr Lys 


Leu Glu Arg Gin Arg 


Phe 










215 






220 


225 


Lys 


Glu 


Glu 


Ala 


Glu Met 


Leu 


Lys Gly 


Leu Gin His Pro Asn 


He 










230 






235 


240 


Val 


Arg 


Phe 


Tyr 


Asp Phe 


Trp 


Glu Ser 


Ser Ala Lys Gly Lys 


Arg 










245 






250 


255 
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Cys He Val Leu Val Thr Glu Leu Met Thr Ser Gly Thr Leu Lys 

260 265 270 

Thr Tyr Leu Lys Arg Phe Lys Val Met Lys Pro Lys Val Leu Arg 

275 280 285 

Ser Trp Cys Arg Gin He Leu Lys Gly Leu Leu Phe Leu His Thr 

290 295 300 

Arg Thr Pro Pro He He His Arg Asp Leu Lys Cys Asp Asn He 

305 310 315 

Phe He Thr Gly Pro Thr Gly Ser Val Lys He Gly Asp Leu Gly 

320 325 330 

Leu Ala Thr Leu Lys Arg Ala Ser Phe Ala Lys Ser Val He Gly 

335 340 345 

Thr Pro Glu Phe Met Ala Pro Glu Met Tyr Glu Glu His Tyr Asp 

350 355 360 

Glu Ser Val Asp Val Tyr Ala Phe Gly Met Cys Met Leu Glu Met 

365 370 375 

Ala Thr Ser Glu Tyr Pro Tyr Ser Glu Cys Gin Asn Ala Ala Gin 

380 385 390 

He Tyr Arg Lys Val Thr Cys Gly He Lys Pro Ala Ser Phe Glu 

395 400 405 

Lys Val His Asp Pro Glu He Lys Glu He He Gly Glu Cys He 

410 415 420 

Cys Lys Asn Lys Glu Glu Arg Tyr Glu He Lys Asp Leu Leu Ser 

425 430 435 

His Ala Phe Phe Ala Glu Asp Thr Gly Val Arg Val Glu Leu Ala 

440 445 450 

Glu Glu Asp His Gly Arg Lys Ser Thr He Ala Leu Arg Leu Trp 

455 460 465 

Val Glu Asp Pro Lys Lys Leu Lys Gly Lys Pro Lys Asp Asn Gly 

470 475 480 

Ala He Glu Phe Thr Phe Asp Leu Glu Lys Glu Thr Pro Asp Glu 

485 490 495 

Val Ala Gin Glu Met He Glu Ser Gly Phe Phe His Glu Ser Asp 

500 505 510 

Val Lys He Val Ala Lys Ser He Arg Asp Arg Val Ala Leu He 

515 520 525 

Gin Trp Arg Arg Glu Arg He Trp Pro Ala Leu Gin Pro Lys Glu 

530 535 540 

Gin Gin Asp Val Gly Ser Pro Asp Lys Ala Arg Gly Pro Pro Val 

545 550 555 

Pro Leu Gin Val Gin Val Thr Tyr His Ala Gin Ala Gly Gin Pro 

560 565 570 

Gly Pro Pro Glu Pro Glu Glu Pro Glu Ala Asp Gin His Leu Leu 

575 580 585 

Pro Pro Thr Leu Pro Thr Ser Ala Thr Ser Leu Ala Ser Asp Ser 

590 595 600 

Thr Phe Asp Ser Gly Gin Gly Ser Thr Val Tyr Ser Asp Ser Gin 

605 610 615 

Ser Ser Gin Gin Ser Val Met Leu Gly Ser Leu Ala Asp Ala Ala 

620 625 630 

Pro Ser Pro Ala Gin Cys Val Cys Ser Pro Pro val Ser Glu Gly 

635 640 645 

Pro Val Leu Pro Gin Ser Leu Pro Ser Leu Gly Ala Tyr Gin Gin 

650 655 660 

Pro Thr Ala Ala Pro Gly Leu Pro Val Gly Ser Val Pro Ala Pro 

665 670 675 

Ala Cys Pro Pro Ser Leu Gin Gin His Phe Pro Asp Pro Ala Met 

680 685 690 

Ser Phe Ala Pro Val Leu Pro Pro Pro Ser Thr Pro Met Pro Thr 

695 700 705 

Gly Pro Gly Gin Pro Ala Pro Pro Gly Gin Gin Pro Pro Pro Leu 

710 715 720 

Ala Gin Pro Thr Pro Leu Pro Gin Val Leu Ala Pro Gin Pro Val 
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725 




vai 


Pro 


Leu 


Gin Pro 


Val Pro Pro 








740 




Pro 


Ma 


Ser 


Gin Val 


Gly Ma Pro 








755 


Met 


Pro 


Gin 


Ma Pro 


Leu Gin Pro 








770 




Met 


Pro 


Pro 


lie Pro 


Val Val Pro 








785 




lie 


Asp 


Gly 


Leu Pro 


Pro Ala Leu 








800 




Val 


Pro 


Pro 


Met Pro 


Pro Pro Gin 








815 




Leu 


Pro 


Ser 


Leu Ma 


Ma Pro Leu 








830 




Pro 


Leu 


Gin 


Ala Val 


Lys Leu Pro 








845 




Ma 


Met 


Pro 


Cys Arg 


Thr lie Val 








860 




Pro 


Leu 


Leu 


Ma Val 


Ma Pro Pro 








875 




His 


Ser 


Ma 


Val Ma 


Gin Leu Pro 








890 




Ma 


Phe 


Pro 


Gin Met 


Ma Pro Thr 








905 




His 


Thr 


Val 


Gin Asn 


Met Arg Ma 








920 




Pro 


Pro 


Gin 


Pro Thr 


Leu Pro Pro 








935 




Pro 


Thr 


Leu 


Pro Pro 


Gin Pro Val 








950 




Pro 


Pro 


Gin 


Pro Val 


Leu Pro Pro 








965 




Pro 


Val 


Leu 


Pro Pro 


Gin Pro Ma 








980 




Leu 


Gin 


Pro 


His Leu 


Pro 61u Gin 








995 




Gly 


Ser 


Gin 


lie Leu 


Leu Gly His 








1010 




Val 


Ma 


Ma 


Gin Val 


Pro Thr Val 








1025 




Leu 


Ser 


Pro 


Pro Leu 


Pro Glu Val 








1040 




liBU 


Leu 


Pro 


Gin Phe 


Pro Ser Ser 








1055 




Val 


Gin 


Ser 


Val Pro 


Thr Gin Thr 








1070 




Asn 


Pro 


Pro 


Leu Pro 


Gly Gly Pro 








1085 




Thr 


Val 


Gin 


Leu Thr 


Val Glu Pro 








1100 




Gin 


Asp 


Lys 


Pro Pro 


Gly Leu Pro 








1115 




Gly 


Ser 


Asp 


Val Thr 


Ser Gly Lys 








1130 




Gly 


Ma 


Phe 


Gly Gly 


Gly Arg Leu 








1145 




His 


His 


Arg 


Arg Ser 


Thr Arg Ma 








1160 




Ser 


Arg 


Pro 


Arg Leu 


Thr lie Leu 








1175 




Lys 


Met 


Val 


Glu cys 


Gin Leu Glu 








1190 





730 


735 


His Leu Pro 


Pro Tyr Leu Ma 


745 


750 


Ma Gin Leu Lys Pro Leu Gin 


760 


765 


Leu Ma Gin Val Pro Pro Gin 


775 


780 


Pro lie Thr 


Pro Leu Ma Gly 


790 


795 


Pro Asp Leu 


Pro Thr Ma Thr 


805 


810 


Tyr Phe Ser 


Pro Ma Val He 


820 


825 


Pro Pro Ma 


Ser Pro Ma Leu 


835 


840 


His Pro Pro Gly Ma Pro Leu 


850 


855 


Pro Asn Ma 


Pro Ma Thr He 


865 


870 


Gly Val Ma 


Ma Leu Ser He 


880 


885 


Gly Gin Pro Val Tyr Pro Ma 


895 


900 


Asp Val Pro 


Pro Ser Pro His 


910 


915 


Thr Pro Pro 


Gin Pro Ma Leu 


925 


930 


Gin Pro Val 


Leu Pro Pro Gin 


940 


945 


Leu Pro Pro 


Gin Pro Thr Arg 


955 


960 


Gin Pro Met 


Leu Pro Pro Gin 


970 


975 


Leu Pro Val 


Arg Pro Glu Pro 


985 


990 


Ma Ma Pro 


Ma Ma Thr Pro 


1000 


1005 


Pro Ma Pro 


Tyr Ma Val Asp 


1015 


1020 


Pro Val Pro 


Pro Ma Ma Val 


1030 


1035 


Leu Leu Pro 


Ma Ma Pro Glu 


1045 


1050 


Leu Ma Thr Val Ser Ma Ser 


1060 


1065 


Ma Thr Leu 


Leu Pro Pro Ma 


1075 


1080 


Gly He Ma 


Ser Pro Cys Pro 


1090 


1095 


Val Gin Glu 


Glu Gin Ma Ser 


1105 


1110 


Gin Ser Cys Glu Ser Tyr Gly 


1120 


1125 


Glu Leu Ser Asp Ser Cys Glu 


1135 


1140 


Glu Gly Arg Ma Ma Arg Lys 


1150 


1155 


Arg Ser Arg Gin Glu Arg Ma 


1165 


1170 


Asn Val Cys Asn Thr Gly Asp 


1180 


1185 


Thr His Asn 


His Lys Met Val 


1195 


1200 
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Thr Phe Lys Phe Asp Leu Asp Gly Asp Ala Pro Asp Glu lie Ala 

1205 1210 1215 

Thr Tyr Met Val Glu His Asp Phe lie Leu Gin Ala Glu Arg Glu 

1220 1225 1230 

Thr Phe He Glu Gin Met Lys Asp Val Met Asp Lys Ala Glu Asp 

1235 1240 1245 

Met Leu Ser Glu Asp Thr Asp Ala Asp Arg Gly Ser Asp Pro Gly 

1250 1255 1260 

Thr Ser Pro Pro His Leu Ser Thr Cys Gly Leu Gly Thr Gly Glu 

1265 1270 1275 

Glu Ser Arg Gin Ser Gin Ala Asn Ala Pro Val Tyr Gin Gin Asn 

1280 1285 1290 

Val Leu His Thr Gly Lys Arg Trp Phe He He Cys Pro Val Ala 

1295 1300 1305 

Glu His Pro Ala Pro Glu Ala Pro Glu Ser Ser Pro Pro Leu Pro 

1310 1315 1320 

Leu Ser Ser Leu Pro Cys Pro Ala Leu Phe Arg Met Ser Cys Ala 

1325 1330 1335 

Ser Val Leu Ala Qys Pro Leu Ser Ala Cys 

1340 1345 

<210> 18 

<211> 2038 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 7476595CD1 

<400> 18 

Met Thr Ala Glu Hur Pro Glu Thr Asp Glu Ser Val Ser Ser Ser 
15 10 15 

Asn Ala Ser Leu lys Leu Arg Arg Lys Pro Arg Glu Ser Asp Phe 
20 25 30 

Glu Thr He Lys Leu He Ser Asn Gly Ala Tyr Gly Ala Val Tyr 
35 40 45 

Phe Val Arg His Lys Glu Ser Arg Gin Arg Phe Ala Met Lys Lys 
50 55 60 

He Asn Lys Gin Asn Leu He Leu Arg Asn Gin He Gin Gin Ala 
65 70 75 

Phe Val Glu Arg Asp He Leu Thr Phe Ala Glu Asn Pro Phe Val 
80 85 90 

Val Ser Met Tyr Cys Ser Phe Glu Thr Arg Arg His Leu Cys Met 
95 100 105 

Val Met Glu Tyr Val Glu Gly Gly Asp Cys Ala Thr Leu Met Lys 
110 115 120 

Asn Met Gly Pro Leu Pro Val Asp Met Ala Arg Met Tyr Phe Ala 
125 130 135 

Glu Thr Val Leu Ala Leu Glu Tyr Leu His Asn Tyr Gly He Val 
140 145 150 

His Arg Asp Leu Lys Pro Asp Asn Leu Leu Val Thr Ser Met Gly 
155 160 165 

His He Lys Leu Thr Asp Phe Gly Leu Ser Lys Val Gly Leu Met 
170 175 180 

Ser Met Thr Thr Asn Leu Tyr Glu Gly His He Glu Lys Asp Ala 
185 190 195 

Arg Glu phe Leu Asp Lys Gin Val Cys Gly Thr Pro Glu Tyr He 
200 205 210 

Ala Pro Glu Val He Leu Arg Gin Gly Tyr Gly Lys Pro Val Asp 
215 220 225 

Trp Trp Ala Met Gly He He Leu Tyr Glu Phe Leu Val Gly Cys 
230 235 240 
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Val 


Pro 


Phe Phe 


lie 


Ser 


Asp Glu 


Pro 


Asp 


Ala Gin 


lieu 


Glu 


Arg Leu 


Arg 


Phe 


Phe Arg 


Ala 


Glu 


Phe lie 


Phe 


Asp 


Thr Arg 


Glu 


Asp 


Asp Thr 


Phe 


Ser 


Ser Cys 


Tyr 


Leu 


Asp Ala 


Trp 


Glu 


Trp His 


Gin 


Ser 


Val Leu 


Arg 


Pro 


Gly Leu 


Pro 


Lvs 


His Cys 


His 


Lvs 


Leu Ser 


Glu 


Gly 


Glu Gin 


Glu 


Glu 


Pro Gly 


Glu 


Glu 


Pro Glu 


Thr 


Leu 


Ser Ast) 


Ser 


Pro 


His Ser 


Ser 


Pro 


Ser Arg 


Pro 


lie 


Val He 


Arg 


Ala 


He Arg 


His 


His 


He Val 


Ala 


Gly 


Leu Lys 


Pro 


Val 


His Gly 


Lys 


Ser 


Gly Asn 


Thr 


Ser 


Xle Lys 


Arg 


Hec 


Val Arg 


Glu 


Arg 


Arg Arg 


Pro 


Leu 


Leu His 


Leu 


Ser 


Ser Gly 



Gly Asp Thr Pro Glu Glu Leu Phe Gly Gin Val 
245 250 255 

He Asn Trp Pro Glu Lys Asp Glu Ala Pro Pro 
260 265 270 

Asp Leu He Thr Leu Leu Leu Arg Gin Asn Pro 
275 280 285 

Gly Thr Gly Gly Ala T?yr Glu Val Lys Gin His 
290 295 300 

Ser Leu Asp Trp Asn Ser Leu Leu Arg Gin Lys 
305 310 315 

Pro Gin Leu Glu Ser Glu Asp Asp Thr Ser Tyr 
320 325 330 

Ser Glu Lys Tyr His His Met Glu Thr Glu Glu 
335 340 345 

Asn Asp Glu Asp Phe Asn Val Glu He Arg Gin 
350 355 360 

Ser His Arg Phe Ser Lys Leu Phe Leu Asn Asp 
365 370 375 

Pro Ala Asn Gly Pro Ala Leu Pro Ser Cys Val 
380 385 390 

Arg Gly Lys Asp Phe Pro Gly Glu Gly Gly Ser 
395 400 405 

Glu Pro Gly Gin Lys Leu Ala Lys Cys Gly Leu 
410 415 420 

Phe Ser Gly Pro Ser Lys Thr Thr Met Pro Thr 
425 430 435 

Phe Leu Leu Cys Leu Asp Thr Glu Ser Asn Arg 
440 445 450 

Ser Gly Leu Leu Pro Lys Leu Ala He Ser Thr 
455 460 465 

Asp Glu Ala Ala Ser Cys Pro Gly Asp Pro His 
470 475 480 

Lys Pro Ala Leu Pro Pro Glu Glu Cys Ala Gin 
485 490 495 

Val Thr Thr Pro Ala Ser Thr He Ser Ser Ser 
500 505 510 

Met Phe Ala Val Ser Pro Leu Gly Ser Pro Met 
515 520 525 

Leu Ser Ser Asp Pro Ser Ser Ser Arg Asp Ser 
530 535 540 

Asp Ser Ser Ala Ala Ser M.a Ser Pro His Gin 
545 550 555 

His Ser Ser Gly Lys Asn Tyr Gly Phe Thr He 
560 565 570 

Val Tyr Val Gly Asp Ser Asp He Tyr Thr Val 
575 580 585 

Trp Asn Val Glu Glu Gly Ser Pro Ala Cys Gin 
590 595 600 

Ala Gly Asp Leu He Thr His He Asn Gly Glu 
605 610 615 

Leu Val His Thr Glu Val He Glu Leu Leu Leu 
620 625 630 

Lys Val Ser He Thr Thr Thr Pro Phe Glu Asn 
635 640 645 

Thr Gly Pro Ala Arg Arg Asn Ser Tyr Lys Ser 
650 655 660 

Arg Ser Lys Lys Ser Lys Lys Lys Glu Ser Leu 
665 670 675 

Ser Leu Phe Lys Lys Leu Ala Lys Gin Pro Ser 
680 685 690 

Ifhr Ser Arg Ser Phe Ser Cys Leu Asn Arg Ser 
695 700 705 

Glu Ser Leu Pro Gly Ser Pro Thr His Ser Leu 
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710 715 720 

Ser Pro Arg Ser Pro Thr Pro Ser Tyr Arg Ser Thr Pro Asp Phe 
725 730 735 

Pro Ser Gly Thr Asn Ser Ser Gin Ser Ser Ser Pro Ser Ser Ser 
740 745 750 

Ala Pro Asn Ser Pro Ala Gly Ser Gly His He Arg Pro Ser Thr 
755 760 765 

Leu His Gly Leu Ala Pro Lys Leu Gly Gly Gin Arg Tyr Arg Ser 
770 775 780 

Gly Arg Arg Lys Ser Ala Gly Asn He Pro Leu Ser Pro Leu Ala 
785 790 795 

Arg Thr Pro Ser Pro Thr Pro Gin Pro Thr Ser Pro Gin Arg Ser 
800 805 810 

Pro Ser Pro Leu Leu Gly His Ser Leu Gly* Asn Ser Lys He Ala 
815 820 825 

Gin Ala Phe Pro Ser Lys Met His Ser Pro Pro Thr He Val Arg 
830 835 840 

His He Val Arg Pro Lys Ser Ala Glu Pro Pro Arg Ser Pro Leu 
845 850 855 

Leu Lys Arg Val Gin Ser Glu Glu Lys Leu Ser Pro Ser Tyr Gly 
860 865 870 

Ser Asp Lys Lys His Leu Cys Ser Arg Lys His Ser Leu Glu Val 
875 880 885 

Thr Gin Glu Glu Val Gin Arg Glu Gin Ser Gin Arg Glu Ala Pro 
890 895 900 

Leu Gin Ser Leu Asp Glu Asn Val cys Asp Val Pro Pro Leu Ser 
905 910 915 

Arg Ala Arg Pro Val Glu Gin Gly Cys Leu Lys Arg Pro Val Ser 
920 925 930 

Arg Lys Val Gly Arg Gin Glu Ser Val Asp Asp Leu Asp Arg Asp 
935 940 945 

Lys Leu Lys Ala Lys Val Val Val Lys Lys Ala Asp Gly Phe Pro 
950 955 . 960 

Glu Lys Gin Glu Ser His Gin Lys Ser His Gly Pro Gly Ser Asp 
965 970 975 

Leu Glu Asn Phe Ala Leu Phe Lys Leu Glu Glu Arg Glu Lys Lys 
980 985 990 

Val Tyr Pro Lys Ala Val Glu Arg Ser Ser Thr Phe Glu Asn Lys 
995 1000 1005 

Ala Ser Met Gin Glu Ala Pro Pro Leu Gly Ser Leu Leu Lys Asp 
1010 1015 1020 

Ala Leu His Lys 61n Ala Ser Val Arg Ala Ser Glu Gly Ala Met 
1025 1030 1035 

Ser Asp Gly Arg Val Pro Ala Glu His Arg Gin Gly Gly Gly Asp 
1040 1045 1050 

Phe Arg Arg Ala Pro Ala Pro Gly Thr Leu Gin Asp Gly Leu Cys 
1055 1060 1065 

His Ser Leu Asp Arg Gly He Ser Gly Lys Gly Glu Gly Thr Glu 
1070 1075 1080 

Lys Ser Ser Gin Ala Lys Glu Leu Leu Arg Cys Glu Lys Leu Asp 
1085 1090 1095 

Ser Lys Leu Ala Asn He Asp Tyr Leu Arg Lys Lys Met Ser Leu 
1100 1105 1110 

Glu Asp Lys Glu Asp Asn Leu Cys Pro Val Leu Lys Pro Lys Met 
1115 1120 1125 

Thr Ala Gly Ser His Glu Cys Leu Pro Gly Asn Pro Val Arg Pro 
1130 1135 1140 

Thr Gly Gly Gin Gin Glu Pro Pro Pro Ala Ser Glu Ser Arg Ala 
1145 1150 1155 

Phe Val Ser Ser Thr His Ala Ala Gin Met Ser Ala Val Ser Phe 
1160 1165 1170 

Val Pro Leu Lys Ala Leu Thr Gly Arg Val Asp Ser Gly Thr Glu 
1175 1180 1185 
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Lys Pro Gly Leu Val Ala Pro Glu Ser Pro Val Arg Lys Ser Pro 

1190 1195 1200 

Ser Glu Tyr Lys Leu Glu Gly Arg Ser Val Ser Cys Leu Lys Pro 

1205 1210 1215 

He Glu Gly Thr Leu Asp He Ala Leu Leu Ser Gly Pro Gin Ala 

1220 1225 1230 

ser Lys Thr Glu Leu Pro Ser Pro Glu Ser Ala Gin Ser Pro Ser 

1235 1240 1245 

Pro Ser Gly Asp Val Arg Ala Ser Val Pro Pro Val Leu Pro Ser 

1250 1255 1260 

Ser Ser Gly Lys Lys Asn Asp Thr Thr Ser Ala Arg Glu Leu Ser 

1265 1270 1275 

Pro Ser Ser Leu Lys Met Asn Lys Ser Tyr Leu Leu Glu Pro Trp 

1280 1285 1290 

Phe Leu Pro Pro Ser Arg Gly Leu Gin Asn Ser Pro Ala Val Ser 

1295 1300 1305 

Leu Pro Asp Pro Glu Phe Lys Arg Asp Arg Lys Gly Pro His Pro 

1310 1315 1320 

Thr Ala Arg Ser Pro Gly Thr Val Met Glu Ser Asn Pro Gin Gin 

1325 1330 1335 

Arg Glu Gly Ser Ser Pro Lys His Gin Asp His Thr Thr Asp Pro 

1340 1345 1350 

Lys Leu Leu Thr Cys Leu Gly Gin Asn Leu His Ser Pro Asp Leu 

1355 1360 1365 

Ala Arg Pro Arg Cys Pro Leu Pro Pro Glu Ala Ser Pro Ser Arg 

1370 1375 1380 

Glu Lys Pro Gly Leu Arg Glu Ser Ser Glu Arg Gly Pro Pro Thr 

1385 1390 1395 

Ala Arg Ser Glu Arg Ser Ala Ala Arg Ala Asp Thr Cys Arg Glu 

1400 1405 1410 

Pro Ser Met Glu Leu Cys Phe Pro Glu Thr Ala Lys Thr Ser Asp 

1415 1420 1425 

Asn Ser Lys Asn Leu Leu Ser Val Gly Arg Thr His Pro Asp Phe 

1430 1435 1440 

Tyr Thr Gin Thr Gin Ala Met Glu Lys Ala Tip Ala Pro Gly Gly 

1445 1450 1455 

Lys Thr Asn His Lys Asp Gly Pro Gly Glu Ala Arg Pro Pro Pro 

1460 1465 1470 

Arg Asp Asn Ser Ser Leu His Ser Ala Gly He Pro Qys Glu Lys 

1475 1480 1485 

Glu Leu Gly Lys Val Arg Aarg Gly Val Glu Pro Lys Pro Glu Ala 

1490 1495 1500 

Leu Leu Ala Arg Arg Ser Leu Gin Pro Pro Gly He Glu Ser Glu 

1505 1510 1515 

Lys Ser Glu Lys Leu Ser Ser Phe Pro Ser Leu Gin Lys Asp Gly 

1520 1525 1530 

Ala Lys Glu Pro Glu Arg Lys Glu Gin Pro Leu Gin Arg His Pro 

1535 1540 1545 

Ser Ser He Pro Pro Pro Pro Leu Thr Ala Lys Asp Leu Ser Ser 

1550 1555 1560 

Pro Ala Ala Arg Gin His Cys Ser Ser Pro Ser His Ala Ser Gly 

1565 1570 1575 

Arg Glu Pro Gly Ala Lys Pro Ser Thr Ala Glu Pro Ser Ser Ser 

1580 1585 1590 

Pro Gin Asp Pro Pro Lye Pro Val Ala Ala His Ser Glu Ser Ser 

1595 1600 1605 

Ser His Lys Pro Arg Pro Gly Pro Asp Pro Gly Pro Pro Lys Thr 

1610 1615 1620 

Lys His Pro Asp Arg Ser Leu Ser Ser Gin Lys Pro Ser Val Gly 

1625 1630 1635 

Ala Thr Lys Gly Lys Glu Pro Ala Thr Gin Ser Leu Gly Gly Ser 

1640 1645 1650 

Ser Arg Glu Gly Lys Gly His Ser Lys Ser Gly Pro Asp Val Phe 
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1655 



1660 



1665 



Pro Ala Thr Pro Gly Ser Gin Asn Lys Ala Ser Asp Gly He Gly 
1670 1675 1680 

Gin Gly Glu Gly Gly Pro Ser Val Pro Leu His Thr Asp Arg Ala 
1685 1690 1695 

Pro Leu Asp Ala Lys Pro Gin Pro Thr Ser Gly Gly Arg Pro Leu 
1700 1705 1710 

Glu Val Leu Glu Lys Pro Val His Leu Pro Arg Pro Gly His Pro 
1715 1720 1725 

Gly Pro Ser Glu Pro Ala Asp Gin Lys Leu Ser Ala Val Gly Glu 
1730 1735 1740 

Lys Gin Thr Leu Ser Pro Lys His Pro Lys Pro Ser Thr Val Lys 
1745 1750 1755 

Asp Cys Pro Thr Leu Cys Lys Gin Thr Asp Asn Arg Gin Thr Asp 
1760 1765 1770 

Lys Ser Pro Ser Gin Pro Ala Ala Asn Thr Asp Arg Arg Ala Glu 
1775 1780 1785 

Gly Lys Lys Cys Thr Glu Ala Leu T^rr Ala Pro Ala Glu Gly Asp 
1790 1795 1800 

Lys Leu Glu Ala Gly Leu Ser - Phe Val His Ser Glu Asn Arg Leu 
1805 1810 1815 

Lys Gly Ala Glu Arg Pro Ala Ala Gly Val Gly Lys Gly Phe Pro 
1820 1825 1830 

Glu Ala Arg Gly Lys Gly Pro Gly Pro Gin Lys Pro Pro Thr Glu 
1835 1840 1845 

Ala Asp Lys Pro Asn Gly Met Lys Arg Ser Pro Ser Ala Thr Gly 
1850 1855 1860 

Gin Ser Ser Phe Arg Ser Thr Ala Leu Pro Glu Lys Ser Leu Ser 
1865 1870 I 1875 

Cys Ser Ser Ser Phe Pro Glu Thr Arg Ala Gly Val Arg Glu Ala 
1880 1885 1890 

Ser Ala Ala Ser Ser Asp Thr Ser Ser Ala Lys Ala Ala Gly Gly 
1895 1900 1905 

Met Leu Glu Leu Pro Ala Pro Ser Asn Arg Asp His Arg Lys Ala 
1910 1915 1920 

Gin Pro Ala Gly Glu Gly Arg Thr His Met Thr Lys Ser Asp Ser 
1925 1930 1935 

Leu Pro Ser Phe Arg Val Ser Thr Leu Pro Leu Glu Ser His His 
1940 1945 1950 

Pro Asp Pro Asn Thr Met Gly Gly Ala Ser His Arg Asp Arg Ala 
1955 I960 1965 

Leu Ser Val Thr Ala Thr Val Gly Glu Thr Lys Gly Lys Asp Pro 
1970 1975 1980 

Ala Pro Ala Gin Pro Pro Pro Ala Arg Lys Gin Asn Val Gly Arg 
1985 1990 1995 

Asp Val Thr Lys Pro Ser Pro Ala Pro Asn Thr Asp Arg Pro He 
2000 2005 2010 

Ser Leu Ser Asn Glu Lys Asp Phe Val Val Arg Gin Arg Arg Gly 
2015 2020 2025 

Lys Glu Ser Leu Arg Ser Ser Pro His Lys Lys Ala Leu 



<210> 19 

<211> 1770 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte IB No: 718243 82CD1 

<400> 19 

Met Ser Gly Glu Val Arg Leu Arg Gin Leu Glu Gin Phe He Leu 



2030 



2035 
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1 



5 



10 



15 



Asp Gly Pro Ala Gin Thr Asn Gly Gin Cys Phe Ser Val Glu Thr 
20 25 30 

Leu Leu Asp lie Leu lie Cys Leu Tyr Asp Glu Cys Asn Asn Ser 
35 40 45 

Pro Leu Arg Arg Glu Lys Asn lie Leu Glu Tyr Leu Glu Trp Ala 
50 55 60 

Lys Pro Phe Thr Ser Lys Val Lys Gin Met Arg Leu His Arg Glu 
65 70 75 

Asp Phe Glu lie Leu Lys Val He Gly Arg Gly Ala Phe Gly Glu 
80 85 90 

Val Ala Val Val Lys Leu Lys Asn Ala Asp Lys Val Phe Ala Met 
95 100 105 

Lys He Leu Asn Lys Trp Glu Met Leu Lys Arg Ala Glu Thr Ala 
110 115 120 

Cys Phe Arg Glu Glu Arg Asp Val Leu Val Asn Gly Asp Asn Lys 
125 130 135 

Trp He Thr Thr Leu His Tyr Ala Phe Gin Asp Asp Asn Asn Leu 
140 145 150 

Tyr Leu Val Met Asp Tyr Tyr Val Gly Gly Asp Leu Leu Thr Leu 
155 160 165 

Leu Ser Lys Phe Glu Asp Arg Leu Pro Glu Asp Met Ala Arg Phe 
170 175 180 

Tyr Leu Ala Glu Met Val He Ala He Asp Ser Val His Gin Leu 
185 190 195 

His Tyr Val His Arg Asp He Lys Pro Asp Asn He Leu Met Asp 
200 205 210 

Met Asn Gly His He Arg Leu Ala Asp Phe Gly Ser Cys Leu Lys 
215 220 225 

Leu Mfet Glu Asp Gly Thr Val Gin Ser Ser Val Ala Val Gly Thr 
230 235 240 

Pro Asp Tyr He Ser Pro Glu He Leu Gin Ala Met Glu Asp Gly 
245 250 255 

Lys Gly Arg Tyr Gly Pro Glu Cys Asp Trp Trp Ser Leu Gly Val 
260 265 270 

Cys Met Tyr Glu Met Leu Tyr Gly Glu Thr Pro Phe Tyr Ala Glu 
275 280 285 

Ser Leu Val Glu Thr Tyr Gly Lys He Met Asn His Lys Glu Arg 
290 295 300 

Phe Gin Phe Pro Ala Gin Val Thr Asp Val Ser Glu Asn Ala Lys 
305 310 315 

Asp Leu He Arg Arg Leu He Cys Ser Arg Glu His Arg Leu Gly 
320 325 330 

Gin Asn Gly He Glu Asp Phe Lys Lys His Pro Phe Phe Ser Gly 
335 340 345 

He Asp Trp Asp Asn He Arg Asn cys Glu Ala Pro Tyr He Pro 
350 355 360 

Glu Val Ser Ser Pro Thr Asp Thr Ser Asn Phe Asp Val Asp Asp 
365 370 375 

Asp Cys Leu Lys Asn Ser Glu Thr Met Pro Pro Pro Thr His Thr 
380 385 390 

Ala Phe Ser Gly His His Leu I>ro Phe Val Gly Phe Thr Tyr Thr 
395 400 405 

Ser Ser Cys Val Leu Ser Asp Arg Ser Cys Leu Arg Val Thr Ala 
410 415 420 

Gly Pro Thr Ser Leu Asp Leu Asp Val Asn Val Gin Arg Thr Leu 
425 430 435 

Asp Asn Asn Leu Ala Thr Glu Ala Tyr Glu Arg Arg He Lys Arg 
440 445 450 

Leu Glu Gin Glu Lys Leu Glu Leu Ser Arg Lys Leu Gin Glu Ser 
455 460 465 

Thr Gin Thr Val Gin Ala Leu Gin Tyr Ser Thr Val Asp Gly Pro 



470 



475 



480 
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Leu Thr Ala Ser Lys Asp Leu Glu lie Lys Asn Leu Lys Glu Glu 

485 490 495 

He Glu Lys Leu Arg Lys Gin Val Thr Glu Ser Ser His Leu Glu 

500 505 510 

Gin Gin Leu Glu Glu Ala Asn Ala Val Arg Gin Glu Leu Asp Asp 

515 520 525 

Ala Phe Arg Gin He Lys Ala Tyr Glu Lys Gin He Lys Thr Leu 

530 535 540 

Gin Gin Glu Arg Glu Asp Leu Asn Lys Glu Leu Val Gin Ala Ser 

545 550 555 

Glu Arg Leu Lys Asn Gin Ser Lys Glu Leu Lys Asp Ala His Cys 

560 565 570 

Gin Arg Lys Leu Ala Met Gin Glu Phe Met Glu He Asn Glu Arg 

575 580 585 

Leu Thr Glu Leu His Thr Gin Lys Gin Lys Leu Ala Arg His Val 

590 595 600 

Arg Asp Lys Glu Glu Glu Val Asp Leu Val Met Gin Lys Val Glu 

605 610 615 

Ser Leu Arg Gin Glu Leu Arg Arg Thr Glu Arg Ala Lys Lys Glu 

620 625 630 

Leu Glu Val His Thr Glu Ala Leu Ala Ala Glu Ala Ser Lys Asp 

635 640 645 

Arg Lys Leu Arg Glu Gin Ser Glu His Tyr Ser Lys Gin Leu Glu 

650 655 660 

Asn Glu Leu Glu Gly Leu Lys Gin Lys Gin He Ser Tyr Ser Pro 

665 670 675 

Gly Val Cys Ser He Glu His Gin Gin Glu He Thr Lys Leu Lys 

680 685 690 

Thr Asp Leu Glu Lys Lys Ser He Phe Tyr Glu Glu Glu Leu Ser 

695 700 705 

Lys Arg Glu Gly He His Ala Asn Glu He Lys Asn Leu Lys Lys 

710 715 720 

Glu Leu His Asp Ser Glu Gly Gin Gin Leu Ala Leu Asn Lys Glu 

725 730 735 

He Met He Leu Lys Asp Lys Leu Glu Lys Thr Arg Arg Glu Ser 

740 745 750 

Gin Ser Glu Arg Glu Glu Phe Glu Ser Glu Phe Lys Gin Gin Tyr 

755 760 765 

Glu Arg Glu Lys Val Leu Leu Thr Glu Glu Asn Lys Lys Leu Thr 

770 775 780 

Ser Glu Leu Asp Lys Leu Thr Thr Leu Tyr Glu Asn Leu Ser He 

785 790 795 

His Asn Gin Gin Leu Glu Glu Glu Val Lys Asp Leu Ala Asp Lys 

800 805 810 

Lys Glu Ser Val Ala His Trp Glu Ala Gin He Thr Glu He He 

815 820 825 

Gin Trp Val Ser Asp Glu Lys Asp Ala Arg Gly Tyr Leu Gin Ala 

830 835 840 

Leu Ala Ser Lys Met Thr Glu Glu Leu Glu Ala Leu Arg Asn Ser 

845 850 855 

Ser Leu Gly Thr Arg Ala Thr Asp Met Pro Trp Lys Met Arg Arg 

860 865 870 

Phe Ala Lys Leu Asp Met Ser Ala Arg Leu Glu Leu Gin Ser Ala 

875 880 885 

Leu Asp Ala Glu He Arg Ala Lys Gin Ala He Gin Glu Glu Leu 

890 895 900 

Asn Lys Val Lys Ala Ser Asn He He Thr Glu Cys Lys Leu Lys 

905 910 915 

Asp Ser Glu Lys Lys Asn Leu Glu Leu Leu Ser Glu He Glu Gin 

920 925 930 

Leu He Lys Asp Thr Glu Glu Leu Arg Ser Glu Lys Gly He Glu 

935 940 945 

His Gin Asp Ser Gin His Ser Phe Leu Ala Phe Leu Asn Thr Pro 
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950 



955 



960 



Thr Asp Ala Leu Asp Gin Phe Glu Asp Ser Phe Ser Ser Ser Ser 
965 970 975 

Ser Ser Leu He Asp Phe Leu Asp Asp Thr Asp Pro Val Glu Asn 
980 985 990 

Thr Tyr Val Trp Asn Pro Ser Val Lys Phe His He Gin Ser Airg 
995 1000 1005 

Ser Thr Ser Pro Ser Thr Ser Ser Glu Ala Glu Pro Val Lys Thr 
1010 1015 1020 

Val Asp Ser Thr Pro Leu Ser Val His Thr Pro Thr Leu Arg Lys 
1025 1030 . 1035 

Lys Gly Cys Pro Gly Ser Thr Gly Phe Pro Pro Lys Arg Lys Thr 
1040 1045 1050 

His Gin Phe Phe Val Lys Ser Phe Thr Thr Pro Thr Lys Cys His 
1055 1060 1065 

Gin Cys Thr Ser Leu Met Val Gly Leu He Arg Gin Gly Cys Ser 
1070 1075 1080 

Cys Glu Val Cys Gly Phe Ser Cys His He Thr Cys Val Asn Lys 
1085 1090 1095 

Ala Pro Thr Thr Cys Pro Val Pro Pro Glu Gin Thr Lys Gly Pro 
1100 1105 * 1110 

Leu Gly He Asp Pro Gin Lys Gly He Gly Thr Ala Tyr Glu Gly 
1115 • 1120 1125 

His Val Arg He Pro Lys Pro Ala Gly Val Lys Lys Gly Trp Gin 
1130 1135 1140 

Arg Ala Leu Ala He Val Cys Asp Phe Lys Leu Phe Leu Tyr Asp 
1145 1150 1155 

He Ala Glu Gly Lys Ala Ser Gin Pro Ser val Val He Ser Gin 
1160 1165 1170 

Val He Asp Met Arg Asp Glu Glu Phe Ser Val Ser Ser Val Leu 
1175 1180 1185 

Ala Ser Asp Val He His Ala Ser Arg Lys Asp He Pro Cys He 
1190 1195 1200 

Phe Arg Val Thr Ala Ser Gin Leu Ser Ala Ser Asn Asn Lys Cys 
1205 1210 1215 

Ser He Leu Met Leu Ala Asp Thr Glu Asn Glu Lys Asn Lys Trp 
1220 1225 1230 

Val Gly Val Leu Ser Glu Leu His Lys He Leu Lys Lys Asn Lys 
1235 1240 1245 

Phe Arg Asp Arg Ser Val Tyr Val Pro Lys Glu Ala Tyr Asp Ser 
1250 1255 1260 

Thr Leu Pro Leu He Lys Thr Thr Gin Ala Ala Ala He He Asp 
1265 1270 1275 

His Glu Arg He Ala Leu Gly Asn Glu Glu Gly Leu Phe Val Val 
1280 1285 1290 

His Val Thr Lys Asp Glu He He Arg Val Gly Asp Asn Lys Lys 
1295 1300 1305 

He His Gin He Glu Leu He Pro Asn Asp Gin Leu Val Ala Val 
1310 1315 1320 

He Ser Gly Arg Asn Arg His Val Arg Leu Phe Pro Met Ser Ala 
1325 1330 . 1335 

Leu Asp Gly Arg Glu Thr Asp Phe Tyr Lys Leu Ser Glu Thr Lys 
1340 1345 1350 

Gly Cys Gin Thr Val Thr Ser Gly Lys Val Arg His Gly Ala Leu 
1355 1360 1365 

Thr Cys Leu Cys Val Ala Met Lys Arg Gin Val Leu Cys Tyr Glu 
1370 1375 1380 

Leu Phe Gin Ser Lys Thr Arg His Arg Lys Phe Lys Glu He Gin 
1385 1390 1395 

Val Pro Tyr Asn Val Gin Trp Met Ala He Phe Ser Glu Gin Leu 
1400 1405 1410 

Cys Val Gly Phe Gin Ser Gly Phe Leu Arg Tyr Pro Leu Asn Gly 



1415 



1420 



1425 
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Glu Gly Asn Pro Tyr Ser Met Leu His Ser Asn Asp His Thr Leu 

1430 1435 1440 

Ser Phe lie Ala His Gin Pro Met Asp Ala He Cys Ala Val Glu 

1445 1450 1455 

He Ser Ser Lys Glu Tyr Leu Leu Cys Phe Asn Ser He Gly He 

1460 1465 1470 

Tyr Thr Asp Cys Gin Gly Arg Arg Ser Arg Gin Gin Glu Leu Met 

1475 1480 1485 

Trp Pro Ala Asn Pro Ser Ser Cys Cys Tyr Asn Ala Pro Tyr Leu 

1490 1495 1500 

Ser Val Tyr Ser Glu Asn Ala Val Asp He Phe Asp Val Asn Ser 

1505 1510 1515 

Met Glu Trp He Gin Thr Leu Pro Leu Lys Lys Val Arg Pro Leu 

1520 1525 1530 

Asn Asn Glu Gly Ser Leu Asn Leu Leu Gly Leu Glu Thr He Arg 

1535 1540 1545 

Leu He Tyr Phe Lys Asn Lys Met JQa Glu Gly Asp Glu Leu Val 

1550 1555 1560 

Val Pro Glu Thr Ser Asp Asn Ser Arg Lys Gin Met Val Arg Asn 

1565 1570 1575 

He Asn Asn Lys Arg Arg Tyr Ser Phe Arg Val Pro Glu Glu Glu 

1580 1585 1590 

Arg Met Gin Gin Arg Arg Glu Met Leu Arg Asp Pro Glu Met Arg 

1595 ' 1600 1605 

Asn Lys Leu He Ser Asn Pro Thr Asn Phe Asn His He Ala His 

1610 1615 1620 

Met Gly Pro Gly Asp Gly He Gin He Leu Lys Asp Leu Pro Met 

1625 1630 1635 

Asn Pro Arg Pro Gin Glu Ser Arg Thr Val Phe Ser Gly Ser Val 

1640 1645 1650 

Ser He Pro Ser He Thr Lys Ser Arg Pro Glu Pro Gly Arg Ser 

1655 1660 1665 

Met Ser Ala Ser Ser Gly Leu Ser Ala Arg Ser Ser Ala Gin Asn 

1670 1675 1680 

Gly Ser Ala Leu Lys Arg Glu Phe Ser Gly Gly Ser Tyr Ser Ala 

1685 1690 1695 

Lys Arg Gin Pro Met Pro Ser Pro Ser Glu Gly Ser Leu Ser Ser 

1700 1705 1710 

Gly Gly Met Asp Gin Gly Ser Asp Ala Pro Ala Arg Asp Phe Asp 

1715 1720 1725 

Gly Glu Asp Ser Asp Ser Pro Arg His Ser Thr Ala Ser Asn Ser 

1730 1735 1740 

Ser Asn Leu Ser Ser Pro Pro Ser Pro Val Ser Pro Arg Lys Thr 

1745 1750 1755 

Lys Ser Leu Ser Leu Glu Ser Thr Asp Arg Gly Ser Trp Asp Pro 

1760 1765 1770 



<210> 20 

<211> 720 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 3566B82CD1 

<400> 20 

Met Ala Ala Asp Pro Thr Glu Leu Arg Leu Gly Ser Leu Pro Val 
15 10 15 

Phe Thr Arg Asp Asp Phe Glu Gly Asp Trp Arg Leu Val Ala Ser 
20 25 30 

Gly Gly Phe Ser Gin Val Phe Gin Ala Arg His Arg Arg Trp Arg 
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35 40 45 





Glu 


Tyr 


Ala 


lie 


Lys 


Cys 


Ala Pro 


Cys Leu 


Pro 


Pro Asp Ala 










50 








55 




60 


Ala 


Arg 


Thr 


Phe 


Ala 


Ala 


Ser 


Val Ser 


Pro Leu 


Pro 


Ser He Tyr 










65 








70 




75 


Leu 


Ala 


Lys 


lie 


Ser 


Asp 


Phe 


Gly Leu 


Ser Lys 


Trp 


Met Glu Gin 










80 








85 




90 


Ser 


Thr 


Arg 


Met 


Gin 


Tyr 


lie 


Glu Arg 


Ser Ala 


Leu 


Arg Gly Met 










95 








100 




105 


Leu 


Ser 


Tyx 


lie 


Pro 


Pro 


Glu 


Met Phe 


Leu Glu 


Ser 


Asn Lys Ala 










110 








115 




120 


Pro 


Gly 


Pro 


Lys 


Tyr 


Asp 


Val 


Tyr Ser 


Pro Pro 


Thr 


Leu Pro Pro 










125 








130 




135 


Arg 


Ala 


Gly 


Val 


lie 


Leu 


Asp 


Val Gin 


Leu Ser 


His 


Ser Glu Arg 










140 








145 




150 


Val 


Leu 


Cys 


lie 


His 


Ser 


Phe 


Ala lie 


Val lie 


Trp 


Glu Leu Leu 










155 








160 




165 


Thr 


Gin 


Lys 


Lys 


Pro 


Tyr 


Ser 


Glu Leu 


Thr Ser 


Gin 


Leu Lys Glu 










170 








175 




180 


Arg 


Lys 


Gly 


Phe 


Asn 


Met 


Met 


Met lie 


He He 


Arg 


Val Thr Ala 










185 








190 




195 


Gly 


Met 


Arg 


Pro 


Ser 


Leu 


Gin 


Pro Val 


Ser Asp 


Gin 


Trp Pro Ser 










200 








205 




210 


Glu 


Ala 


Gin 


Gin 


Met 


Val 


Asp 


Leu Met 


Lys Arg 


Cys 


Trp Asp Gin 










215 








220 




225 


Asp 


Pro 


Lys 


Lys 


Arg 


Pro 


Cys 


Phe Leu 


Asp He 


Thr 


He Glu Thr 










230 








235 




240 


Asp 


lie 


Leu 


Leu 


Ser 


Leu 


Leu 


Gin Ser 


Arg Val 


Ala 


Val Pro Glu 










245 








250 




255 


Ser 


Lys 


Ala 


Leu 


Ala 


Arg 


Lys 


Val Ser 


Cys Lys 


Leu 


Ser Leu Arg 










260 








265 




270 


Gin 


Pro 


Gly 


Glu 


Val 


Asn 


Glu 


Asp lie 


Ser Gin 


Glu 


Leu Met Asp 










275 








280 




285 


Ser 


Asp 


Ser 


Gly 


Asn 


Tyr 


Leu 


Lys Arg 


Ala Leu 


Gin 


Leu Ser Asp 










290 








295 




300 


Arg 


Lys 


Asn 


Leu 


Val 


Pro 


Arg 


Asp Glu 


Glu Leu 


Cys 


He Tyr Glu 










305 








310 




315 


Asn 


Lys 


Val 


Thr 


Pro 


Leu 


His 


Phe Leu 


Val Ala 


Gin 


Gly Ser Val 










320 








325 




330 


Glu 


Gin 


Val 


Arg 


Leu 


Leu 


Leu 


Ala His 


Glu Val 


Asp 


Val Asp Cys 










335 








340 




345 


Gin 


Thr 


Ala 


Ser 


Gly 


Tyr 


Thr 


Pro Leu 


Leu He 


Ala 


Ala Gin Asp 










350 








355 




360 


Gin 


Gin 


Pro 


Asp 


Leu 


cys 


Ala 


Leu Leu 


Leu Ala 


His 


Gly Ala Asp 










365 








370 




375 


Ala 


Asn 


Arg 


Val 


Asp 


Glu 


Asp 


Gly Trp 


Ala Pro 


Leu 


His Phe Ala 










380 








385 




390 


Ala 


Gin 


Asn 


Gly 


Asp 


Asp 


Gly 


Thr Ala 


Arg Leu 


Leu 


Leu Asp His 










395 








400 




405 


Gly 


Ala 


Cys 


Val 


Asp 


Ala 


Gin 


Glu Arg 


Glu Gly 


Trp 


Thr Pro Leu 










410 








415 




420 


His 


Leu 


Ala 


Ala 


Gin 


Asn 


Asn 


Phe Glu 


Asn Val 


Ala 


Arg Leu Leu 










425 








430 




435 


Val 


Ser 


Arg 


Gin 


Ala 


Asp 


Pro 


Asn Leu 


His Glu 


Ala 


Glu Gly Lys 










440 








445 




450 


Thr 


Pro 


Leu 


His 


Val 


Ala 


Ala 


Tyr Phe 


Gly His 


Val 


Ser Leu Val 










455 








460 




465 


Lye 


Leu 


Leu 


Thr 


Ser 


Gin 


Gly 


Ala Glu 


Leu Asp 


Ala 


Gin Gin Arg 










470 








475 




480 


Asn 


Leu 


Arg 


Thr 


Pro 


Leu 


His 


Leu Ala 


Val Glu 


Arg 


Gly Lys Val 










485 








490 




495 


Arg 


Ala 


lie 


Gin 


His 


Leu 


Leu 


Lys Ser 


Gly Ala 


Val 


Pro Asp Ala 










500 








505 




510 
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Asp 


Gin 


Sex* Gly 




PI \r 


IrXO XibU 


UXS 


mv,.-. Ala 

xnr AXci 


A 1 =a 

i\ia 


Axa 


Arg 








515 




















liys 




Ij6U. X 1 6 




Xiys 


UTq^ X ^^^^ 


Leu 


Arg Tyr 


v^xy 


^1 a 

Axa 


ber 








•Jo? \J 








D JD 








b4u 


LiGU 


Glu 




Pro Tlur 


Hi s 

XIX D 




v»xy irp 


Thr 


Pro Leu 


UXS 


Leu 


Ala 
























c e c 

555 




TVT* 






UCU 




Tl A Tl A 

xxe xxe 


JaXS 


Leu Leu 


Axa 


V9XU 


Ser 
























570 


Hi A 




Ann 








^1 «r K 1 a 

laxy Axa 


vax 


Asn Trp 


Thr 


Pro 


Leu 
















con 








585 




XJCU 


Al a 


Ala Axg 


IlXS 


\»xy 


*^i oi 

IvXU GXU 


X 1 ^ 

Ala 


Val Val 


Ser 


Ik 1 

AXa 


Leu 
















c n cr 

595 








600 


XiGU 






oxy /ua 


Asp 


Firo 


Asn Ala 


Ik 1 » 
AXa 


Glu Gin 


Ser 


Gly 


Trp 
















610 








615 






li6U 


His IiGU 


1^ 1 B 

AJ.a 


val 


Gin Arg 


Ser 


Thr Phe 


Leu 


Ser 


Val 








con 








625 








630 


Tl A 


Asn 


IjGU 


XiGU Glu 


HIS 


HXS 


Ala Asn 


Val 


His Ala 


Arg 


Asn 


Lys 








bob 








640 






645 


VcLx 




Trp 


Thic Pro 


Aj.a 


Hxs 


Leu Ala 


Ala 


Leu Lys 


Gly 


Asn 


Thr 








ceo 
bOU 








655 








660 




J.X6 


XiGU 


liys vai 


Leu 


vax 


Glu Ala 


Gly 


Ala Gin 


Leu 


Asp 


Val 








ODD 








D /O 






675 


Gin 


Asp 


Gly 


Val Ser 


Cys 


Thr 


Pro Leu 


Gin 


Leu Ala 


Leu 


Arg 


Ser 








680 








685 






690 


Arg 


Iiys 


Gin 


Gly lie 


Met 


Ser 


Phe Leu 


Glu 


Gly Lys 


Glu 


Pro 


Ser 








695 








700 








705 


Val 


Ala 


Thr 


lieu Gly 


Gly 


Ser 


Lys Pro 


Gly 


Ala Glu 


Met 


Glu 


lie 








710 








715 








720 



<210> 21 

<211> 5200 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 4615110CB1 

<400> 21 

cgtggctgag ccagcagctg cagcagctac 
ccgccatgga ggccgtgccc cgcatgccca 
actttcactt ccagccagct gtgaagaagt 
aagcctacaa tgaagaactg aagaagctgg 
cacgagactt tgagggctgt agtgtcctcc 
agagtcgggt ccccatgggc tcgggccagg 
tcttctcagg caagtctgtg gcccatgagg 
acaaccttgg agcgctgcac tccatgctgg 
gcatgaaggt ctcctgtacc catttccagt 
agcacttccc tcaagcctac agcgtcgaca 
acctcatgct gggccaggct caggagtgcc 
agagctttct ggtggcccgc atcagtgcac 
gggccttgga gaaccccgac actgcctcac 
aacttgtgca gatgaagatc tactacttcg 
aggccgagga gcagcagaag ttcggggagc 
agctcaatga agccatcaag ttggccaagg 
gcttcactat ggatgtcatt gggggaaagt 
tttaccatga ggctgtccca gcattggaca 
tgaagccctt gccagtgaac cccacagacc 
aactggtacc catggctgcc cacgaggcct 
tgctccggga gatgatggcc aagattgagg 
attcaatgca gttggatccc gagacggtgg 
cccagctcat ggagaagtgc gcggctctca 



gggagtggcc gggtggccgg cgggtgccag 60 
tgatctggct ggacctgaag gaggccggtg 120 
ttgtcctgaa gaattatgga gagaacccag 180 
agttgctcag acagaatgct gtccgtgtcc 240 
gcaagtacct cggccagctt cattacctgc 300 
aggccgctgt ccctgtcacc tggacagaga 360 
acatcaagta cgagcaggcc tgtattctct 420 
gggccatgga caagcgggtg tctgaggagg 480 
gcgcagccgg cgccttcgcc tacctacggg 540 
tgagccgcca gatccttacg ctcaacgtca 600 
tcctggagaa gtcgatgttg gacaacagga 660 
aggtggtaga ttactacaag gaggcatgcc 720 
tgctgggccg gatccagaag gactggaaga 780 
cagccgtggc tcatctgcac atgggaaagc 840 
gggttgcata cttccagagc gccctggaca 900 
gccagcctga cactgtgcaa gacgcgcttc 960 
acaattctgc caagaaggac aacgacttca 1020 
cccttcagcc tgtaaaagga gcccccttgg 1080 
cagctgttac aggccctgac atctttgcca 1140 
cgtcactgta cagtgaggag aaggccaagc 1200 
acaagaatga ggtcctggac cagttcatgg 1260 
acaaccttga tgcctacagc cacatcccac 1320 
gcgtccggcc cgacactgtc aggaaccttg 1380 
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tacagtccat gcaagtgctg tcaggtgtgt tcacggatgt ggaggcttcc ctgaaggaca 1440 
tcagagatct gttggaggag gatgagctgc tagagcagaa gtttcaggag gcggtgggcc 1500 
aggcaggggc catctccatc acctccaagg ctgagctggc agaggtgagg cgagaatggg 1560 
ccaagtacat ggaagtccat gagaaggcct ccttcaccaa cagtgagctg caccgtgcca 1620 
tgaacctgca cgtcggcaac ctgcgcctgc tcagcgggcc gcttgaccag gtccgggctg 1680 
ccctgcccac accggccctc tccccagagg acaaggccgt gctgcaaaac ctaaagcgca 1740 
tcctggctaa ggtgcaggag atgcgggacc agcgcgtgtc cctggagcag cagctgcgtg 1800 
agcttatcca gaaagatgac atcactgcct cgctggtcac cacagaccac tcagagatga 1860 
agaagttgtt cgaggagcag ctgaaaaagt atgaccagct gaaggtgrtac ctggagcaga 1920 
acctggccgc ccaggaccgt gtcctctgtg cactgacaga ggccaacgtg cagtacgcag 1980 
ccgtgcggcg ggtactcagc gacttggacc aaaagtggaa ctccacgctg cagaccctgg 2040 
tggcctcgta tgaagcctat gaggacctga tgaagaagtc gcaggagggc agggacttct 2100 
acgcagatct ggagagcaag gtggctgctc tgctggagcg cacgcagtcc acctgccagg 2160 
cccgcgaggc tgcccgccag cagctcctgg acagggagct gaagaagaag ccgccgccac 2220 
ggcccacagc cccaaagccg ctgctgcccc gcagggagga gagtgaggca gtggaagcag 2280 
gagacccccc tgaggagctg cgcagcctcc cccctgacat ggtggctggc ccacgactgc 2340 
ctgacacctt cctgggaagt gccaccccgc tccactttcc tcccagcccc ttccccagct 2400 
ccacaggccc aggaccccac tatctctcag gccccttgcc ccctggtacc tactcgggcc 2460 
ccacccagct gatacagccc agggccccag ggccccatgc aatgcccgta gcacctgggc 2520 
ctgccctcta cccagcccct gcctacacac cggagctggg ccttgtgccc cgatcctccc 2580 
cacagcatgg cgtggtgagc agtccctatg tgggggtagg gccggcccca ccagttgcag 2640 
gtctcccctc ggccccacct cctcaattct caggccccga crttggccatg gcggttcggc 2700 
cagccaccac cacagtagat agcatccagg cgcccatccc cagccacaca gccccacggc 2760 
caaaccccac ccctgctcct cccccgccct gcttccctgt gcccccaccg cagccactgc 2820 
ccacgcctta cacctaccct gcaggggcta agcaacccat cccagcacag caccacttct 2880 
cttctgggat ccccacaggt tttccagccc caaggattgg gccccagccc cagccccatc 2940 
ctcagcccca tccttcacaa gcgtttgggc ctcagccccc acagcagccc cttccactcc 3000 
agcatccaca tctcttccca ccccaggccc caggactcct acccccacaa tccccctacc 3060 
cctatgcccc tcagcctggg gtcctggggc agccgccacc ccccctacac acccagctct 3120 
acccaggtcc cgctcaagac cctctgccag cccactcagg ggctctgcct ttccccagcc 3180 
ctgggccccc tcagcctccc catcccccac tggcatat:gg tcctgcccct tctaccagac 3240 
ccatgggccc ccaggcagcc cctcttacca ttcgagggcc ctcgtctgct ggccagtcca 3300 
cccctagtcc ccacctggtg ccttcacctg ccccatctcc agggcctggt ccggtacccc 3360 
ctcgcccccc agcagcagaa ccaccccctt gcctgcgccg aggcgccgca gctgcagacc 3420 
tgctctcctc cagcccggag agccagcatg gcggcactca gtctcctggg ggtgggcagc 3480 
ccctgctgca gcccaccaag gtggatgcag ctgagggtcg tcggccgcag gccctgcggc 3540 
tgattgagcg ggacccctat gagcatcctg agaggctgcg gcagttgcag caggagctgg 3600 
aggcctttcg gggtcagctg ggggatgtgg gagctctgga cactgtctgg cgagagctgc 3660 
aagatgcgca ggaacatgat gcccgaggcc gttccatcgc cattgcccgc tgctactcac 3720 
tgaagaaccg gcaccaggat gtcatgccct atgacagtaa ccgtgtggtg ctgcgctcag 3780 
gcaaggatga ctacatcaat gccagctgcg tggaggggct ctccccatac tgccccccgc 3840 
tagtggcaac ccaggcccca ctgcctggca cagctgctga cttctggctc atggtccatg 3900 
agcagaaagt gtcagtcatt gtcatgctgg tttctgaggc tgagatggag aagcaaaaag 3960 
tggcacgcta cttccccacc gagaggggcc agcccatggt gcacggtgcc ctgagcctgg 4020 
cattgagcag cgtcogcagc accgaaaccc atgtggagcg cgtgctgagc ctgcagttcc 4080 
gagaccagag cctcaagcgc tctcttgtgc acctgcactt ccccacttgg cctgagttag 4140 
gcctgcccga cagccccagc aacttgctgc gcttcatcca ggaggtgcac gcacattacc 4200 
tgcatcagcg gccgctgcac acgcccatca ttgtgcactg cagctctggt gtgggccgca 4260 
cgggagcctt tgcactgctc tatgcagctg tgcaggaggt ggaggctggg aacggaatcc 4320 
ctgagctgcc tcagctggtg cggcgcatgc ggcagcagag aaagcacatg ctgcaggaga 4380 
agctgcacct caggttctgc tatgaggcag tggtgagaca cgtggagcag gtcctgcagc 4440 
gccatggtgt gcctcctcca tgcaaaccct tggccagtgc aagcatcagc cagaagaacc 4500 
accttcctca ggactcccag gacctggtcc tcggtgggga tgtgcccatc agctccatcc 4560 
aggccaccat tgccaagctc agcattcggc ctcctggggg gttggagtcc ccggttgcca 4620 
gcttgccagg ccctgcagag cccccaggcc tcccgccagc cagcctccca gagtctaccc 4680 
caatcccatc ttcctcccca cccccccttt cctccccact acctgaggct ccccagccta 4740 
aggaggagcc gccagtgcct gaagccccca gctcggggcc cccctcctcc tccctggaat 4800 
tgctggcctc cttgacccca gaggccttct ccctggacag ctccctgcgg ggcaaacagc 4860 
ggatgagcaa gcataacttt ctgcaggccc ataacgggca agggctgcgg gccacccggc 4920 
cctctgacga ccccctcagc cttctggatc cactctggac actcaacaag acctgaacag 4980 
gttttgccta cctggtcctt acactacatc atcatcatct catgcccacc tgcccacacc 5040 
cagcagagct tctcagtggg cacagtctct tactcccatt tctgctgcct ttggccctgc 5100 
ctggcccagc ctgcsacccct gtggggtgga aatgtactgc aggctctggg tcaggttctg 5160 
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ctcctttatg ggacccgaca tttttcagct ctttgctatt 5200 

<21D> 22 

<211> 4330 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4622229CB1 

<400> 22 

ctgaggcggg cgggcggtat agagcgggcg gcaggaggca agcagcgaaa ccttcccggc 60 
cgccgctccc gtcccgacgg cggcttcccc aaggcggcag gactcggcgc gccatggaca 120 
ggccggcggc ggcggcggcg gcgggctgcg agggcggcgg gggcccgaac ccggggccgg 180 
cgggcggcag gaggcctcct cgggccgcgg ggggcgccac cgccggctcc cggcagccca 240 
gcgtggagac cctggacagt cccacaggat cacatgttga atggtgtaaa cagcttatag 300 
ctgctacaat ttctagtcag atttcaggtt cagtgacatc agaaaatgtg tccagagatt 360 
acaaggtttt caggaggcct gatctaaggg ctctaaggga tggaaataag ctggcacaga 420 
tggaagaggc tccacttttc ccaggagaat caattaaagc cattgtgaaa gatgtcatgt 480 
atatctgccc atttatggga gcagtgagtg gaaccctgac agtgacggac tttaagctgt 540 
acttcaaaaa tgtcgagagg gacccgcatt ttatccttga tgttcccctt ggagtgatca 600 
gcagagtgga gaagattgga gcacagagcc atggagacaa ttcctgtggt atagagatag 660 
tgtgcaagga tatgaggaac ttgcggcttg cttataaaca ggaagaacag agtaaactag 720 
ggatatttga aaacctcaac aaacatgcat ttcctctttc taacggacag gcactatttg 780 
cattcagcta taaagaaaaa tttccaatta atggctggaa agtttatgat ccagtatctg 840 
aatataagag acagggcttg ccaaatgaga gttggaaaat atccaaaata aacagtaatt 900 
atgagttctg tgacacctac cctgccatca ttgttgtgcc aactagtgta aaagatgatg 960 
acctttcaaa agtggcagct tttcgagcaa aaggcagagt ccctgtgttg tcatggattc 1020 
atccggaaag tcaagcaacg attacccgtt gcagccagcc acttgtgggt cccaatgata 1080 
agcgctgcaa agaggatgaa aaatacttgc aaacaataat ggatgctaac gcacagtcac 1140 
acaagcttat catctttgat gctcgacaaa acagtgtcgc tgataccaac aagacaaagg 1200 
gtggaggata tgaaagtgaa agtgcttacc caaatgcaga acttgtgttc ttggagatcc 1260 
acaacattca tgtcatgcga gagtcactac gcaaattaaa agagattgtg tacccttcga 1320 
tcgatgaggc gcggtggctc tccaatgtgg atgggacgca ttggctggaa tatataagga 1380 
tgctgcttgc tggggcagta agaattgctg ataaaataga atctgggaaa acatctgtgg 1440 
tggtgcattg cagcgacggt tgggaccgaa cagcccagct cacatctctg gctatgctaa 1500 
tgttggacag ttactacagg accattaaag gatttgaaac tctcgtagaa aaggagtgga 1560 
taagctttgg acacaggttt gcactgcgag tgggccatgg taatgacaac catgcggatg 1620 
ctgaccgatc tcccatattt ctgcagtttg ttgattgtgt ttggcaaatg acaaggcagt 1680 
ttccttcagc attcgagttt aatgagctat tcttgattac aattttggat cacctttata 1740 
gctgtctttt tgggaccttt ttgtgcaact gtgaacagca gcgattcaaa gaggatgtat 1800 
atacaaagac gatatcttta tggtcgtata tcaatagcca gctagacgag ttttctaatc 1860 
ccttctttgt gaattatgaa aaccacgtgt tatatcctgt tgctagtctg agtcatttgg 1920 
aattgtgggt aaattattat gtacgatgga atccacggat gagacctcag atgcccattc 1980 
accagaatct caaggagctg ctggccgtca gggcggagct gcagaagcgt gtggagggtc 2040 
tacagcggga ggtggccacg cgcgccgtct catcctcatc tgagcggggc tcctcgccct 2100 
cccactccgc cacctccgtc cacacctcgg tctgatgggc gaggtcagcc tgctgctcca 2160 
ctgtctcccg gtggctcagg aaagggacct ggcgatcact gttatggctg tagcttgtga 2220 
tcttgtcttt taggattagg cccagggacc atttgtgtgg ctaggtgaca gctcccactg 2280 
ttggcaaccg ttaccctcct gtcagcggtt tcacagggga gccgtctgtc acgcccaccc 2340 
tgtgaagcaa cttctggcat tcaggcagct tgggagaaac taagtgaacg gaatgcagta 2400 
ctgaggttca agaaagctgt acgccatttc tttccaactt aaatccttca gtaacaacaa 2460 
acaccactca cttcaagatg cattgccagc cccgtggctt ccctcagctc ttggccacaa 2520 
cttgaaaact gtcttgaatg aagtacttgg ggagaagaca ggccactgcc ctctgttcca 2580 
cagttttctt catgcacggg gtcctcctgt taacaattac tgttgtgtac atataaggta 2640 
tttttagaga agagaaacag gcctttattt tcctatgtcc ttttttacgt ttagaatagt 2700 
cacccgaggg gggatcagct caactgtact gtgggagaaa ttcttttcca acaaacctca 2760 
tgctcgtttt tctgtggtgc aatttcaagg gcaacgtgtt ctgtcctcac ctcactctgg 2820 
tactcgcctc ttggggcggc tcagcccatt catggggatg gcaccaagcg gccatgctca 2880 
gtcttccagc cccgctgagg gtaaaccgag gcctctggca gctgtgcaca ggtgctggcc 2940 
tctggctcct tcaaggagca ctgcctgtca ctcgctcctg ggctgtctag ccatgtctcc 3000 
cacccccact ttaccgcagc cagctgctgg gatcaaagca agtctgttct tatgttattt 3060 
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gcctgtatga aatcatttct cattttatca caattccttc aactcagctt actcgcgtgg 3120 
ctgcctgttc atatttgaaa gcagccaccg tgctgtggct ttggtttgga aaagcatagc 3180 
acgcacttcc cttggttttc ccttcccaga gccgaccgca gctggtcagc cctctcttcc 3240 
cgctcctgaa cctttactta ctgactttga gctctgtgac tccgtcggtt ctcgcaggaa 3300 
ttaactaact taccaattgg ttcaatccac ttgagcgcca tagctctgag ctcctctgtg 3360 
tgacatgcca cagatgacta ttgcacacct gggtcctgcc ccagcaggcc atgcccctcc 3420 
catgtgccgt gcctgttgct gcagctgccc ccccaccccg ccactggctg caggaattca 3480 
gcctttagag gcagaggcag ctgcagcggc ccctgaggtc aaaccccagt gtgactgcat 3540 
agcagtgtta gctggttggt ttcaaactac tggattccag gcaaaggcct acagattgac 3600 
cttattattt ttgaaaatat gttaagggtt ttttcataga gagagaaaga atggattttt 3660 
ttttaactgg gaacctcctg attcttacgg aaaattatcc ttctataaga agataccaga 3720 
gagatttatt caaggtaatt tgataaccta aaatcaattc tccatttttt atcatatgtg 3780 
ggatttgttg ctaagtcgtg ttcaacaata gcttttatgt tcctaacata tctgaaagct 3840 
tatttatgaa tggatatact ggattattga tatactgatt ttttttttaa tggggacatt 3900 
tgccattttc ttcccagaaa tatgtaatcc cctggctgac taggactgtt aaacatagtg 3960 
tggactggac gatgccttcg acaaaccaga gaaaccaagt tggggggagc tggrtgcctgg 4020 
agtgggccct gtgcacctca cctggcggag gctggggggg ctctgtcagc aggaccctag 4080 
aggagactct cattcgattt taaagaagca caacgggtca ttttcctttg tatgttccta 4140 
gcgcagaact gtttctaaaa caacttgaag tatagttttg ttatctaagc aatttttgtt 4200 
ttaagtaagt aagtgtacta gaatgcgaag ccgttatggt tcaggttttt aaaaactggt 4260 
acagtattgt atttgtctca tctgttgcac tgtatttcaa tcatctgtaa ttaaaatgat 4320 
catatgttta 4330 

<210> 23 

<211> 2851 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 72358203CB1 

<400> 23 

atcttgaatg cagggacagc catttggaag actgtcctcg agggtggcag cagcctctca 60 
ggacggggaa agccaaggtg cccactaagc ccgtctgggg tggagggtgg caggccgggg 120 
tggagaggat tggaggccgc ctgaaggaac ctgtgctcgg tggcatttac tcaatgtggg 180 
ggtctgacac ctcccagctc attttgtccc catcttcccc ttgcgtcagg tccccaccca 240 
gcaggaggag gtccgaggat ctgcgcgacg ctggccccgc ggagtgtggg ggacttttcc 300 
tctcaaccac cagtgccccg caagcgtggg tgaacagtcc tccctggcta ctgtgggacg 360 
ctgggcaggc gacttcgcct ctctaggcct cggttttcct gcttgtcaaa tggggctgat 420 
gccggcgtgg gcttcttcag gcggtcgcga gcgttgaccc ctggagtcag cgaaccagcc 480 
gcgcacgcac tcccgggcgg aggtcggggc tggggggcga cgcctcccgt ctgcgcgccc 540 
ccggccccgc ctcccgccgg cgcacccctc cctcggctcc gcccgcggcc cgcttcttcc 600 
tcccgcgggc ggcccagccc tagcgccccg cgctccgcgg gcagccccct gccgccgcgc 660 
catgtccgcc ggctggttcc ggcgccgctt cctgcctggg gagccgctcc ccgcgccgcg 720 
gccgcctggg ccgcatgcca gccccgtgcc ctaccgacgg ccccgcttcc ttcgcggctc 780 
cagctccagc cccggggcgg ccgacgcctc gcgccgccca gactcccggc ccgtgcgcag 840 
ccccgcacga ggacgcacgc taccctggeia tgcaggctac gccgagatta tcaatgcaga 900 
gaaatctgaa ttcaatgagg atcaagccgc ctgtgggaag ctgtgcatcc ggagatgtga 960 
gtttggggct gaagaagagt ggctgaccct gtgcccagag gagttcctga caggccatta 1020 
ctgggcactg ttcgatgggc acggcggtcc tgcagcagcc atcttggctg ccaacaccct 1080 
gcactcctgc ttgcgccggc agctggaggc cgtggtggaa ggcttggtgg ccactcagcc 1140 
ccccatgcac ctcaatggcc gctgcatctg ccccagtgac cctcagtttg tggaggaaaa 1200 
gggcatcagg gcagaagact tggtgatcgg ggcattggag agtgcctttc aggaatgtga 1260 
tgaggtgatc gggcgggagc tggaggcctc aggccagatg ggcggctgca cagccctggt 1320 
ggctgtgtcc ctgcagggaa agctgtacat ggccaatgct ggggatagca gggccatctt 1380 
99t:gcggaga gatgagatac ggccactgag cttcgagttc accccagaga ctgagcggca 1440 
gcggatccag cagctggcct ttgtctatcc tgagcttctg gctggtgagt tcacccgact 1500 
ggagttccct cggcggctga agggggatga cttgggacag aaggttttgt tcagggatca 1560 
ccacatgagt ggctggagct acaaacgtgt ggagaaatcg gatctcaagt acccactgat 1620 
ccatggacag ggtaggcagg ctcggttact aggaacactg gctgtctccc ggggcctggg 1680 
agaccatcag ctcagagtcc tggacacaaa catccagctc aagcccttct tgctctctgt 1740 
gccacaggtg actgtgctgg atgtggacca gctggagcta caggaggatg atgtggttgt 1800 
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catggcaact gatggactct gggatgtact gtccaacgag caggtggcat ggctggtgcg 1860 
gagcttcctc cctgggaacc aagaggaccc acacaggttc tcaaagctgg cccagatgct 1920 
gatacacagc acacagggaa aggaagacag tctcacagag gaagggcagg tgtcctacga 1980 
tgacgtctct gtgttcgtga ttcccttgca cagtcagggc caagagagca gtgaccactg 2040 
aggattcaga cactgtatcc cagaactgct ctagtgcccg ggtgtggtct gggcatccct 2100 
ccagtgtgac caagagcaaa tcctgcctgc cctatcccta gccaccgccc agtgctctca 2160 
ctatccacct caacacacat ccatctcaag aggaacattt ataccaggca gtcagagctg 2220 
gaagtgtat'g gagagcccag cccaccaggt cctgcctttt gcggtgataa ccttctctgg 2280 
cagagtgact ttacaactta actaggaaac ccatgtgagg ctcctcagac aggatcttga 2340 
acagcccaaa gtatcattct cagatagggg cacccaagct aagggtatta gccaaagatg 2400 
ccaggatggg tagctagccc atgtttagat ccaggtctcc aattcatggt tatcagggca 2460 
tgtgttcaac aacccccaaa gtccacgcag gtggcttgta gaaacctttg ggcagcctca 2520 
tgtctgctaa aacagccatc ttcaagacag cccctgaaaa gagaccagtt caggtcctgc 2580 
cctgctgttc tttgctggag atgaggaaca ggtgctgggg ctaaagtttg gggtagagca 2640 
caagggacaa gaggaactct tggagttggc tgggtgagag ggctctccat ttgctacctg 2700 
tagtagcctg cctcttaact ggttgcttct ccctagttcc agccctgccc tggtctgatg 2760 
ccccaacact gcccttgctt tgttttccct gtcacctccc tattattaaa tgttttctac 2820 
agaaaaaaaa aaaaaaaaaa aaaaaaaaaa a 2851 

<210> 24 

<211> 2361 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 4885040CB1 

<400> 24 

ggctcctacc agccattgta ggccaataat ccgttatgga gcatgccttt accccgttgg 60 
agcccctgct ttccactggg aatttgaagt actgccttgt aattcttaat cagcctttgg 120 
acaactattt tcgtcatctt tggaacaaag ctcttttaag agcctgtgcc gatggaggtg 180 
ccaaccgctt atatgatatc accgaaggag agagagaaag ctttttgcct gaattcatca 240 
atggagactt tgattctatt aggcctgaag tcagagaata ctatgctact aagggatgtg 300 
agctcatttc aactcctgat caagaccaca ctgactttac taagtgcctt aaaatgctcc 360 
aaaagaagat agaagaaaaa gacttaaagg ttgatgtgat cgtgacactg ggaggccttg 420 
ctgggcgttt tgaccagatt atggcatctg tgaatacctt gttccaagcg actcacatca 480 
ctccttttcc aattataata atccaagagg aatcgctgat ctacctgctc caaccaggaa 540 
agcacaggtt gcatgtagac actggaatgg agggtgattg gtgtggcctt attcctgttg 600 
gacagccttg tatgcaggtt acaaccacag gcctcaagtg gaacctcaca aatgatgtgc 660 
ttgcttttgg aacattggtc agtacttcca atacctacga cgggtctggt gttgtgactg 720 
tggaaactga ccacccactc ctctggacca tggccatcaa aagctaacct gttgactggc 780 
atccataagt gtgcctctgc cttatctcat ttctcaacag ttcattgctc aacaagaacg 840 
attcacctgg gtttgcaaga atctaaacct ctctagggga agcccactgg gtttaaagat 900 
gttagtgttt agataataca ggtaacatta taaatgacag atctcaattt tatagtagtg 960 
ggaaagatac atgctaagaa agcaaataag ctctattata ttcggttgga acctaatggg 1020 
aatcattcca ctatacaatt cagtactgat tattcttctt acattattaa tcattccatt 1080 
tatcctagaa aattgttttt aatttgaatc agagaaaact gttgaggttc ctcttggagt 1140 
ctagaacatc cttaaatgtc taacaacaag ggctacctct gagtaccttt tagtattagt 1200 
tttctgtata tgatatatat tatcttatac tgaaaaaaaa ttcctttcag attggggtgt 1260 
tagaagtgca ccaggtcact ctgaccttat tactgtcttt ggtattgtct taaataaatc 1320 
aagaatcatt gacctaattg ttaaatttaa aaataggtag ttagcaatag gtggaaagag 1380 
aaatgatgtg aaagataaat gatgattcgt ggagccctac tcacacatta acccccaaat 1440 
tcaaaagtaa gaatgcaaaa gtctagaggg ggtaacagtc tgcatcatca tcacaaccta 1500 
aatggagaaa gctgtgcaga ggaaacttaa gcataaaaat tgaattcgtt tctgacatac 1560 
cttagactga aaaactgttg gttcatccag aagtgtattc atattaccag aaaatgagtt 1620 
tgtctatggg gatacatgaa cttcatatac taaggagcct aactccaaag cctgcgttct 1680 
catcccagtc tgatattcac ctaagtttcc ggaccctttt ccttagctgt aaaatggaag 1740 
cggttggact gatggtgtct gaggttcttt cccacactga aattctaaat attgacactt 1800 
agcagtcata gggctgataa tacacacagt tactgactta gcctaaacaa cctggtgcat 1860 
cgaaatgtat tcacctttct tttgtaaaga gaccatcttc tatcttcttt ccacctttct 1920 
ctgttttatg aaaccaactg ttgacataca aaccatgatt gaaggagaac ctgtccaaca 1980 
tgttttatgt acacaaatcc ctatgttgct ataagaaaag tgaaagtaac tgttttcttc 2040 
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ttggtgctat gacagtgtga gactcaggtt 
gcgtgattgt ggcatttaag gagcagtggc 
cattactttc tgcttgaccg gaagttgagg 
agactagtta tatattcctt actagaaata 
tgattttaaa attttgctac caaagaatgc 
tgttaataaa tggatactta t 

<210> 25 

<211> 2285 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^featiire 

<223> Incyte ID No: 7484507CB1 



gtctgtagag aatgaaagga gcagtggccc 2100 
ccatgtgact gtggcatttt cggcactttt 2160 
cttagctatg tttccatctt cagtttctga 2220 
tattcataat atataaaaga aatatatctg 2280 
atgttctgtg tgccctgaaa atgttaccag 2340 

2361 



<400> 25 

gcggtccctc ccggcccggc ggaacgcgtc 
ggggccagcg cgcgggaggg acgcctgagt 
tcccgcgggt cccactgacc cacgcggggt 
ggtcgctact gatcatgctt gggccagggt 
gaggcccagg gtcccccgga gagcccatgg 
ctggggcctt gggggtgaac ctggtggtgg 
taaagcaggt ggaatgcatg gatgaccatt 
cactgctgaa gctgcggcac gcccacatct 
atggggagat ctcttctctg tacctctgcc 
aggaggtcat tgaggataag aggaaggcaa 
atgtgctggg ccaggtgctg gacgcgctgg 
ggaatctcaa accctccaac atcatcctca 
tgagttccaa tgtgctaatg acagacaaag 
cctttcgtaa gtcctggatg gcccctgaag 
acatctggtc cctgggctgc atcattctgg 
cageiagccat gcatctgcgg aagtccctcc 
tgaagacaat ggaggagaag cagatcccgg 
tgatgctcca gatcgacccc tcggatcgaa 
tcttgagagg ctccttcaag tcctcgtgcg 
ctgcgtccat caccgacatg ctgttagaag 
gggacacaaa gggggagcgt gccctgaagc 
tagttccaga gggttcatta tttatgcccc 
tcagctgtga ccaggacaga gtccctggga 
ggaagctgtt gggccccatc ccaaagggtc 
tggtcacgac catggagcta catgacaggg 
tgctgctgca cctcctgggc caaggtatca 
cggacctcat cagccaggtg ttggccacct 
gctgcggagt cttctggctg ctgtccctgc 
aagtggtggc gctgctcctg caaagcatcc 
acaatgccta ccggggactg gccagcctgg 
tggtggtgca ggaggagggc ggcagtggcc 
acagggacga cccggaggtg gtggagaacg 
atgaggagat cctgccggag ctggtgtcca 
aggagcgctt cacctccagc ctggaactgg 
tggaggcagc cacctctccc agcccactgg 
gagaagacag ataccccaca ggcccctccc 
tccatgggcc actgtttccc ttggggtggg 
cctcacactc ctcttaggtg actaataaag 
aaaaa 

<210> 26 

<211> 4858 

<212> DNA 

<213> Homo sapiens 



ccttttaagg gggcggggac ctgggggtct 60 
gcctcgaggg cgccgttcgg gcggggagga 120 
ggggccaggg gtggacgctc gcccgtacgc 180 
ccaatcgcag gcgccccacg cagggggagc 240 
agaagtacca ggttttgtac cagctgaatc 300 
aggaaatgga aaccaaagtc aagcatgtga 360 
acgccagtca ggccctggag gagctgatgc 420 
ctgtgtacca ggagctgttc atcacgtgga 480 
tggtgatgga gttcaatgag ctcagcttcc 540 
agaaaatcat tgactctgag tggatgcaga 600 
aatacctgca ccatttggac atcatccaca 660 
tcagcagtga ccactgcaaa ctgcaggacc 720 
ccaaatggaa tattcgtgcg gaggaagacc 780 
ccctcaactt ctccttcagc cagaaatcag 840 
acatgaccag ctgctccttc atggatggca 900 
gccagagccc aggcagcctg aaggccgtcc 960 
atgtggaaac cttcaggaat cttctgccct 1020 
taacgataaa ggacgtggtg cacatcacct 1080 
tctctctgac cctgcaccgg cagatggtgc 1140 
gcaacgtggc cagcatttta ggtgatgctg 1200 
tcctgtccat ggccttggca tcctattgtt 1260 
tggccttgct ccacatgcac gaccagtggc 1320 
agagagactt tgcctccctg gggaaactag 1380 
tgccgtggcc cccggagctg gtggaggtgg 1440 
tcctcgatgt ccagctgtgt gcctgctccc 1500 
ttgtgaacaa ggcccccttg gagaaggtcc 1560 
accctgcgga tggggaaatg gcagaagcca 1620 
tgggctgcat caaggagcag cagtttgaac 1680 
ggctgtgcca ggacagagcc ctgctggtga 1740 
tgaaggtgtc agagctggcg gccttcaagg 1800 
tcagcctcat caaggagacc taccagctcc 1860 
tgggcatgct gctggtccac ctggcttcct 1920 
gtagtatgaa ggccctgctc caggagatca 1980 
tttcttgcgc ggaaaaagtg ctcttgaggc 2040 
gtggggaagc agctcagccc tgatgcgggg 2100 
tccacgtgtg ccctctccct gtccttcctt 2160 
gggaagggtc atccagcacc agaatgcgca 2220 
aggcccaagg ccagtttctg ccttaaaaaa 2280 

2285 



<220> 
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<221> inisc_feature 

<223> Incyte ID No: 7198931CB1 

<400> 26 

atggcggcgg cggcggggaa tcgcgcctcg 
agccctgagg caggcggcgg cggaggagcc 
gcgggactgc tgcgggaggc gggcagcggg 
cagctgcgca aagtgcggag tgtggagctg 
gccgcctcac cgccggcctc ctcgacttcc 
agtgggaccg gcttccagcc tgtggcggtg 
ggcgcccacc ttaccgagtc ggtggcggcg 
gccgagcccg gggagaagcg ggcgcccgcc 
ggtcgtgaga tggagaataa agaaactctc 
gaggaacgaa tgatcaggga gaaactgaag 
tggttggaaa ggagaaatag gcgagggcct 
gatggatctg aaatgaatca cttagcagct 
gcttcaccag cttccaaagg ccgacgcagt 
acagtgaaat cagaatctcc aggagtaagg 
agtggcagaa tcacaccacc ccgaagagcc 
cctgaggaaa caaaccgccg tgttaacaaa 
cagatagggc ctaactcttt cctgattgga 
tttattgggc ctcagaactg cagctgtgca 
gtgatgctcc gggtgtttca actagaacct 
aagaattttg aggttgagag tttgttccag 
aaagctccat ctcgtaacac catccagaag 
ttgtcatcat ctagtacttc tacatctagt 
cagatgtgtc ctatttgctt gttgggcatg 
gacggctgca ggaacaagct gcaccaccac 
agaaatagag aacctttaat atgtcccctt 
tacagccacg agttgtcaag tcctgtggat 
caaaccgtac agcagcagcc tttggctgga 
cttactcatt atggaactca gcaaatccct 
attcaggtgt ttggaatgga actcgttggc 
gagatggccc tcaggcgtct ttcccatgat 
gagagcactg gaaattctgg gggcagcagt 
gggtcttccc agaccagtat ctcaggagat 
atggtctgtg ctgaccctgt ctacaaagtg 
atgctggtat atactccttg ccacagttta 
cagccagttg tagacaccat cctagtcaaa 
ctgtccatat caacactgtt ggaactgtgc 
agagaaatac t€iaaagctgg atccattggt 
attcttggaa accaaactga atcaaacaat 
atagatagac tgttgttgga atttcctgct 
gtttcacaag ctgagcctgt tgaaatcagg 
gctttgcagt ccattaataa ttcccactca 
ttgagttctg caagaatggt tactacagta 
ctgagtgttt ccagttccac tcacttcacc 
gatgaggtgg aaattgccga agccatccag 
caggacagct tcttgcaggc atctgttccc 
tcccctgagt gcacaatcca tttagagaaa 
agtgccagtt cagaggacat ttctgagaga 
tcaacaacaa caacaacaac aacagagcaa 
ccccacagtc agtgtttgaa ctcctctcct 
gccttgtcaa ccccttcttc ttctacccca 
tctaagcata gacttcaggg attcattccc 
cagcgcaagt tttctctaca attccacaga 
ctttccccag tctttactca cftcaagaccc 
ccatctcgac ctaccccagg taatacaagt 
acacttgatc tgaacagtag ttccaaatgt 
agtaatgctg ttatacccag tgacgagaca 
ttagatcrtca atacagagct caactccagt 
tcaagtgata caacagtaac ttttaagtca 
gaaaatgatg atacctacaa agatgatgtg 



tcgtcgggat tcccgggcgc cagggctacg 60 
ctcaaggcga gcagcgcgcg cgcggctgcc 120 
ggccgcgagc gggcggactg gcggcggcgg 180 
gaccagctgc ctgagcagcc gctcttcctt 240 
ccgtcgccgg agcccgcgga cgcagcgggg 300 
ccgccgcccc acggagccgc cagccggcgc 360 
ccggacagcg gcgcctcgag tcccgcagcg 420 
gccgagccgt ctcctgcagc ggcccccgcc 480 
aaagggttgc acaagatgga tgatcgtcca 540 
gcaacctgta tgccagcctg gaagcacgaa 600 
gtggtggtaa aaccaatccc agttaaagga 660 
gagtctccag gagaggtcca ggcaagtgcg 720 
ccttctcctg gcaactcccc atcaggtcgc 780 
agaaaaagag tttccccagt gccttttcag 840 
ccttcaccag atggcttctc accatatagc 900 
gtgatgcggg ccagactcrta cttactgcag 960 
ggagacagcc cagacaataa ataccgggtg 1020 
cgtggaacat tctgtattca tctgctattt 1080 
tcagacccaa tgttatggag aaaaacttta 1140 
aaatatcaca gtaggcgtag ctcaaggatc 1200 
tttgtttcac gcatgtcaaa ttctcataca 1260 
tcagaaaaca gcataaagga tgaagaggaa 1320 
cttgatgaag aaagtcttac agtgtgtgaa 1380 
tgcatgtcaa tttgggcaga agagtgtaga 1440 
tgtagatcta agtggagatc tcatgatttc 1500 
tccccttctt ccctcagagc tgcacagcag 1560- 
tcacgeiagga atcaagagag caattttaac 1620 
cctgcttaca aagatttagc tgagccatgg 1680 
tgcttatttt ctagaaactg gaatgtgaga 1740 
gtcagtgggg ccctgctgtt ggcaaatggg 1800 
ggaagcagcc cgagtggggg agccaccagt 1860 
gtggtggagg catgctgcag cgttctgtca 1920 
tacgttgctg ctttaaaaac attgagagcc 1980 
gcggaaagaa tcaaacttca gagacttctc 2040 
tgtgcagatg ccaatagccg cacaagtcag 2100 
aaaggccaag caggagagtt ggcagttggc 2160 
attggtggtg ttgattatgt cttaaattgt 2220 
tggcaagaac ttcttggccg cctttgtctt 2280 
gaattttatc ctcatattgt cagtactgat 2340 
tataagaagc tgctgtccct cttaaccttt 2400 
atggttggca aactttccag aaggatctac 2460 
ccccatgtgt tttcaaaact gttagaaatg 2520 
aggatgcgtc gccgtttgat ggctattaca 2580 
ttgggcgtag ciagacacttt ggatggtcaa 2640 
aacaactatc tggaaaccac agagaacagt 2700 
actggaaaag gattatgtgc tacaaaattg 2760 
ctggccagca tttcagtagg accttctagt 2820 
ccaaagccaa tggttcaaac aaaaggcaga 2880 
ttatctcatc attcccaatt aatgtttcca 2940 
tctgtaccag ctggcactgc aacagatgtc 3000 
tgcagaatac cttctgcatc tcctcaaaca 3060 
aactgtcctg aaaacaaaga ctcagataaa 3120 
ttgccctcca gtaacataca caggccaaag 3180 
aaacagggag atccctcacia aaatagcatg 3240 
gatgacagct ttggctgtag cagcaatagt 3300 
gtgttcaccc cagtagagga gaaatgcaga 3360 
attgaggacc ttcttgaagc atctatgcct 3420 
gaagttgctg tcctgtctcc tgaaaaggct 3480 
aatcataatc aaaagtgcaa agagaagatg 3540 
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gaagctgaag aagaagaagc tttagcaatt gccatggcaa tgtcagcgtc tcaggatgcc 3600 
ctccccatag ttcctcagct gcaggttgaa aatggagaag atatcatcat tattcaacag 3660 
gatacaccag agactctacc aggacatacc aaagcaaaac aaccgtatag agaagacact 3720 
gaatggctga aaggtcaaca gataggcctt ggagcatttt cttcttgtta tcaggctcaa 3780 
gatgtgggaa ctggaacttt aatggctgtt aaacaggtga cttatgtcag aaacacatct 3840 
tctgagcaag aagaagtagt agaagcacta agagaagaga taagaatgat gagccatctg 3900 
aatcatccaa acatcattag gatgttggga gccacgtgtg agaagagcaa ttacaatctc 3960 
ttcattgaat ggatggcagg gggatcggtg gctcatttgc tgagtaaata tggagccttc 4020 
aaagaatcag tagttattaa ctacactgaa cagttactcc gtggcctttc gtatctccat 4080 
gaaaaccaaa tcattcacag agatgtcaaa ggtgccaatt tgctaattga cagcactggt 4140 
cagagactaa gaattgcaga ttttggagct gcagccaggt tggcatcaaa aggaactggt .4200 
gcaggagagt ttcagggaca attactgggg acaattgcat ttatggcacc tgaggtacta 4260 
agaggtcaac agtatggaag gagctgtgat gtatggagtg ttggctgtgc tattatagaa 4320 
atggcttgtg caaaaccacc atggaatgca gaaaaacact ccaatcatct tgctttgata 4380 
tttaagattg ctagtgcaac tactgctcca tcgatccctt cacatttgtc tcctggttta 4440 
cgagatgtgg ctcttcgttg tttagaactt caacctcagg acagacctcc atcaagagag 4500 
ctactgaagc atccagtctt tcgtactaca tggtagccaa ttatgcagat caactacagt 4560 
agaaacagga tgctcaacaa gagaaaaaaa acttgtgggg aaccacattg atattctact 4620 
ggccatgatg ccactgaaca gctatgaacg aggccagtgg ggaaccctta cctaagtatg 4680 
tggattgaca aatcatgatc tgtacctaag ctcagtatgc aaaagcccaa actagtgcag 4740 
aaactgtaaa ctgtgccttt caagaactgg cctaagtgaa ccaggaaaac aatgaagttt 4800 
gctgacttaa tttgaaagct attttttctc ctggaccctt tttcgaaaaa ttacgcta 4858 

<210> 27 

<211> 2903 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc.featiire 

<223> Incyte ID No: 7482905CB1 

.<400> 27 

tatgacgtcg cctgtacagc ggtaccgtga gcttcgagta gttcgtgcat ctgggaccgt 60 
tattccatac taacgtcctg tgtcactgag ttttttaaat gtctagcata tctgtaaaga 120 
tgccttagaa aaagaatcat ggagaagtat gttagactac agaagattgg agaaggttca 180 
tttggaaaag ccattcttgt taaatctaca gaagatggca gacagtatgt tatcaaggaa 240 
attaacatct caagaatgtc cagtaaagaa agagaaggct ggaatttatt gaciciaagaaa 300 
agaaacaaaa ggatcagatt attagtttaa tgaaggctga acaaatgaaa aggcaagaaa 360 
aggaaaggtt ggaaagaata aatagggcca gggaiacaagg atggagaaat gtgctaagtg 420 
ctggtggaag tggtgaagta aaggctcctt ttctgggcag tggagggact atagctccat 480 
catctttttc ttctcgagga cagtatgaac attaccatgc catttttgac caaatgcagc 540 
aacaaagagc agaagataat gaagctaaat ggaaaagaga aatatatggt cgaggtcttc 600 
cagaaaggca aaaagggcag ctagctgtag aaagagctaa acaagtagaa gagttcctgc 660 
agcgaaaacg ggaagctatg cagaatciaag ctcgagccga aggacatatg gtttatctgg 720 
caagactgag gcaaataaga ctacagaatt tcaatgagcg ccaacagatt aaagccaaac 780 
ttcgtggtga aaagaaagaa gctaatcatt ctgaaggaca agaaggaagt gaagaggctg 840 
acatgaggcg caaaaaaatc gaatcactga aggcccatgc aaatgcacgt gctgctgtac 900 
taaaagaaca actagaacga aagagaaagg aggcttatga gagagaaaaa aaagtgtggg 960 
aagagcattt ggtggctaaa ggagttaaga gttctgatgt ttctccacct ttgggacagc 1020 
atgaaacagg tggctctcca tcaaagcaac agatgagatc tgttatttct gtaacttcag 1080 
ctttgaaaga agttggcgtg gacagtagtt taactgatac ccgggaaact tcagaagaga 1140 
tgcaaaagac caacaatgct atttcaagta agcgagaaat acttcgtaga ttaaatgaaa 1200 
atcttaaagc tcaagaagat gaaaaaggaa agcagaatct ctctgatact tttgagataa 1260 
atgttcatga agatgccaaa gagcatgaaa aagaaaaatc agtttcatct gatcgcaaga 1320 
agtgggaggc aggaggtcaa cttgtgattc ctctggatga gttaacacta gatacatcct 1380 
tctctacaac tgaaagacat acagtgggag aagttattaa attaggtcct aatggatctc 1440 
caagaagagc ctgggggaaa agtccgacag attctgttct aaagatactt ggagaagctg 1500 
aactacaact tcagacagaa ctattagaaa atacaactat tagaagtgag atttctcccg 1560 
aaggggaaaa gtacaaaccc ttaattactg gagaaaaacia agtacaatgt atttcacatg 1620 
aaataaaccc atcagctatt gttgattctc ctgttgagac aaaaagtccc gagttcagtg 1680 
aggcatctcc acagatgtca ttgaaactgg aaggaaattt agaagaacct gatgatttgg 1740 
aaacagaaat tctacaagag ccaagtggaa caaacaaaga tgagagcttg ccatgcacta 1800 
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ttactgatgt gtggattagt gaggaaaaag aaacaaagga aactcagtcg gcagatagga 1860 
tcaccattca ggaaaatgaa gtttctgaag atggagtctc gagtactgtg gaccaactta 1920 
gtgacattca tatagagcct ggaaccaatg attctcagca ctctaaatgt gatgtagata 1980 
agtctgtgca accggaacca tttttccata aggtggttca ttctgaacac ttgaacttag 2040 
tccctcaagt tcaatcagtt cagtgttcac cagaagaatc ctttgcattt cgatctcact 2100 
cgcatttacc accaaaaaat aaaaacaaga attccttgct gattggactt tcaactggtc 2160 
tgtttgatgc aaacaaccca aagatgttaa ggacatgttc acttccagat ctctcaaagc 2220 
tgttcagaac ccttatggat gttcccaccg taggagatgt tcgtcaagac aatcttgaaa 2280 
tagatgaaat tgaagatgaa aacattaaag aaggaccttc tgattctgaa gacattgtgt 2340 
ttgaagaaac tgacacagat ttacaagagc tgcaggcctc gatggaacag ttacttaggg 2400 
aacaacctgg tgaagaatac agtgaagaag aagagtcagt cttgaagaac agtgatgtgg 2460 
agccaactgc aaatgggaca gatgtggcag atgaagatga caatcccagc agtgaaagtg 2520 
ccctgaacga agaatggcac tcagataaca gtgatggtga aattgctagt gaatgtgaat 2580 
gcgatagtgt ctttaaccat ttagaggaac tgagacttca tctggagcag gaaatgggct 2640 
ttgaaaaatt ctttgaggtt tatgagaaaa taaaggctat tcatgaagat gaagatgaaa 2700 
atattgaaat ttgttcaaaa atagttcaaa atattttggg aaatgaacat cagcatcttt 2760 
atgccaagat tcttcattta gtcatggcag atggagccta ccaagaagat aatgatgaat 2820 
aatcctcaaa atgtttttta atcctcaact atatgaaagc atttgaattt ggcttatcag 2880 
aataacagct tcagtgggag gcg 2903 

.<210> 28 
<211> 1812 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 7483019CB1 

<400> 28 

cttttccttc cctgtgccca ggccttgctc agtgcccatg acacgatagc tcagaaagat 60 
tttgaacccc ttctccctcc actgccagac aatatccctg agagtgagga agcaatgagg 120 
attgtttgtt tagtgaaaaa ccaacagccc ctgggagcca ccatcaagcg ccacgagatg 180 
acaggggaca tcttggtggc caggatcatc cacggtgggc tggcggagag aagtgggttg 240 
ctatatgctg gagacaaact ggtagaagtg aatggagttt cagttgaggg actggaccct 300 
gaacaagtga tccatattct ggccatgtct cgaggcacaa tcatgttcaa ggtggttcca 360 
gtctctgacc ctcctgtgaa tagccagcag atggtgtacg tccgtgccat gactgagtac 420 
tggccccagg aggatcccga catcccctgc atggacgctg gattgccttt ccagaagggg 480 
gacatcctcc agattgtgga ccagaatgat gccctctggt ggcaggcccg aaaaatctca 540 
gaccctgcta cctgcgctgg gcttgtccct tctaaccacc ttctgaagag gaagcaacgg 600 
gaattctggt ggtctcagcc gtaccagcct cacacctgcc tcaagtcaac cctatacaag 660 
gaggagtttg ttggctacgg tcagaagttc tttatagctg gcttccgccg cagcatgcgc 720 
ctttgtcgca ggaagtctca cctcagcccg ctgcatgcca gtgtgtgctg caccggcagc 780 
tgctacagtg cagtgggtgc cccttacgag gaggtggtga ggtaccagcg acgcccttca 840 
gacaagtacc gcctcatagt gctcatggga ccctctggtg ttggagtaaa tgagctcaga 900 
agacaactta ttgaatttaa tcccagccat tttCcLaagtg ctgtgccaca cactactcgt 960 
actaaaaaga gttacgaaac gaatgggcgt gagtatcact atgtgtccaa ggaaacattt 1020 
gciaaacctca tatatagtca caggatgctg gagtatggtg agtacaaagg ccacctgtat 1080 
ggcactagtg tgggtgctgt tcaaacagtc cttgtcgaag gaaagatctg tgtcatggac 1140 
ctagagcctc aggatattca aggggttcga acccatgaac tgaagcccta tgtcatattt 1200 
ataaagccat cgaatatgag gtgtatgaaa caatctcgga aaaatgccaia ggttattact 1260 
gactactatg tggacatgaa gttcaaggat gaagacctac aagagatgga aaatttagcc 1320 
caaagaatgg aaactcagtt tggccaattt tttgatcatg tgattgtgaa tgacagcttg 1380 
cacgatgcat gtgcccagtt gttgtctgcc atacagaagg ctcaggagga gcctcagtgg 1440 
gtaccagcaa catggatttc ctcagatact gagtctcaat gagacttctt gtttaatgct 1500 
ggagttttaa cactgtaccc ttgatacagc gatccatagt tgcaatctaa aacaacagta 1560 
tttgacccat tttaatgtgt acaactttaa aagtgcagca atttattaat taatcttatt 1620 
tgaaaaaaat ttttattgta tggttatgtg gttacctatt ttaacttaat tttttttcct 1680 
ttacctcata tgcagctgtg gtagaaatat ' gaataatgtt aggtcactga gtatgagaac 1740 
ctttcgcaga tttcacatga tctttttaag atttaaataa agagctttcc taaataaaaa 1800 
aaaaaciaaaa gg 1812 

<210> 29 
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<211> 5480 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feat\ire 

<223> Incyte ID No: 5455490CB1 

<400> 29 

ggtgtttcgg aagatcatgt tttttgaaga 
gaagctttta taaatttcct ttggctgaca 
cctaccaaga ctccagtgtg agggcaaaaa 
ggcgagagag actgggagca ccatgtctgc 
ctgaagtaaa ccagcacatg ttttcaccca 
tcccatttag tgctgattgt gctttggcta 
gagcccacag cagtcctggc actccttgtt 
caagtaaccg caagagcttg attgtgacct 
actcaccact ccatggccac acaggtaaca 
caaatgcacc tgctcacttt tcttttgttc 
ggactgatgg gcggcgctgg tctttggcct 
ctcctagctc cactgtctca tcatcatgct 
tccagcctac agctgatgag ctgcactttt 
cagatgagga aggacggcag tccccagcca 
gacgatcccc agtatccttt gacagtgaaa 
gattcccaaa ggccaccgca caaatggaag 
ctccagacag cgtgctgccc ttggcagatg 
ttgagatggc ccgagactgc ctggataaat 
tctacgaact tcaagagaat ttggagaaac 
gctcagaagt ggcttttgtg atgcagctgg 
cagcacgtct cctggaatgc ctggagtttg 
cagctgaggg ccacgccaaa gagggacaag 
ttagccagct gggcctcacc cgggatcccc 
acagtcctga cactccagag acagatgatt 
ctaaaaagac accctctgaa gaggacttcg 
atggggctgt atttctggtg cggcacaagt 
tcaacaagca gaacctgatc ctacggaacc 
tactgacttt cgctgagaac ccctttgtgg 
gccacttgtg catggtgatg gagtacgttg 
atattggggc cctgcctgtg gacatggtgc 
tggagtactt acacaactat ggcatcgtgc 
ttacatccat ggggcacatc aagctcacgg 
gtctgacaac gaacttgtat gagggtcata 
agcaggtatg cgggacccca gaatacattg 
ggaagccagt ggactggtgg gccatgggca 
tccctttttt tggagatact ccggaggagc 
tgtggcctga gggtgatgag gcactgcccc 
tccaccagaa ccctctggag agacttggca 
cattctttac tggtctggac tggacaggac 
agttggagtc agaggatgat actagctatt 
tggactcgga ggatgaggaa gaagtgagtg 
cttcctgctc tccaaggttc aacaaggtgt 
aggagcgccg gacaccaccc ccgaccaagc 
cagatggcct ggcagggctc aaaggccgag 
tattacggaa gcggctgtcg gtgtctgagt 
caatgacagt gcgacgccgc tgctcaggcc 
ctgaggaggc cagcagcacc ctcaggaggc 
ccccatctgg agagggggta tctgggcctg 
agctggatga ggaagctgtt ggccggagca 
gccgtgggac ctcacagctg gctgagggag 
tgcgtagggc ccgccaccgg ctgctctctg 
ctgtcaacaa agtgatcaag tccgcctcag 
aacaccacac ctgctccccg ttggccagcc 
catcatcccg ggactcttct ccaagcaggg 



aaagtactta attttttgcc gtaagtttgg 60 
gaactgcata ccccttgtgt gagagaactt 120 
cttgagtagc caggagaatg atgaaacgga 180 
ggattcaaat ctctactctt tgccgaggag 240 
catctgctcc agccctcttc ctcactaaag 300 
cttctcctct tgccattttc ctgaacccac 360 
ccagccgccc actgccgtgg agttgtcgga 420 
ctagcacatc acctacacta ccacggccac 480 
gtcctttgga cagcccccgg aatttctctc 540 
ctgcccgtag ccatagccac agagctgaca 600 
ctttgccctc ttcaggatat ggaactaaca 660 
cctcacagga aaagctgcat cagttgcctt 720 
tgacgaagca tttcagcaca gagagcgtac 780 
tgcggcctcg ctcccggagc ctcagtcccg 840 
taataatgat gaatcatgtt tacaaagaaa 900 
agcgactagc agagtttatt tcctccaaca 960 
gagccctgag ctttattcat catcaggtga 1020 
ctcggagtgg cctcattaca tcacaatact 1080 
ttttacaaga tgctcatgag cgctcagaga 1140 
tgaaaaagct gatgattatc attgcccgcc 1200 
accctgaaga gttctaccac cttttagaag 1260 
ggattaaatg tgacattccc cgctacatcg 1320 
tagaagaaat ggcccagttg agcagctgtg 1380 
ctattgaggg ccatggggca tctctgccat 1440 
agaccattaa gctcatcagc aatggcgcct 1500 
ccacccggca gcgctttgcc atgaagaaga 1560 
agatccagca ggccttcgtg gagcgtgaca 1620 
tcagcatgtt ctgctccttt gataccaagc 1680 
aagggggaga ctgtgccact ctgctgaaga 1740 
gtctatactt tgcggaaact gtgctggccc 1800 
accgrtgacct caagcctgac aacctcctaa 1860 
actttggact gtccaaaatt ggcctcatga 1920 
ttgaaaagga tgcccgggaa ttcctggaca 1980 
cgcctgaggt gatcctgcgc cagggctatg 2040 
ttatcctgta tgagttcctg gtgggctgcg 2100 
tctttgggca ggtgatcagt gatgagattg 2160 
cagacgccca ggacctcacc tccaaactgc 2220 
caggcagtgc ctatgaggtg aagcagcacc 2280 
ttctccgcca gaaggctgaa tttattcctc 2340 
ttgacacccg ctcagagcga taccaccaca 2400 
aggatggctg ccttgagatc cgccagttct 2460 
acagcagcat ggagcggctc tcactgctcg 2520 
gcagcctgag tgaggagaag gaggaccatt 2580 
accggagctg ggtgattggc tcccctgaga 2640 
catcccacac agagagtgac tcaagccctc 2700 
tcctggatgc gcctcggttc ccggagggcc 2760 
aaccacagga gggtatatgg gtcctgacac 2820 
tcactgaaca ctcaggggag cagcggccaa 2880 
gtggttccag tccagctatg gagacccgag 2940 
ccacagccaa ggccatcagt gacctggctg 3000 
gggactcaac agagaagcgc actgctcgcc 3060 
ccacagccct ctcactcctc attccttcgg 3120 
ccatgtcccc acattctcag tcgtccaacc 3180 
acttcttgcc agcccttggc agcatgaggc 3240 
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ctcccatcat catccaccga gctggcaaga agtatggctt caccctgcgg gccattcgcg 3300 
tctacatggg tgactccgat gtctacaccg tgcaccatat ggtgtggcac gtggaggatg 3360 
gaggtccggc cagtgaggca gggcttcgtc aaggtgacct catcacccat gtcaatgggg 3420 
aacctgtgca tggcctggtg cacacggagg tggtagagct gatcctgaag agtggaaaca 3480 
aggtggccat ttcaacaact cccctggaga acacatccat taaagtgggg ccagctcgga 3540 
agggcagcta caaggccaag atggcccgcia ggagcaagag gagccgcggc aaggatgggc 3600 
aagaaagcag aaaciaggagc tccctgttcc gcaagatcac caagcaagca tccctgctcc 3660 
acaccagccg cagcctttcc tcccttaacc gctccttgtc atcaggggag agtgggccag 3720 
gctctcccac acacagccac agcctttccc cccgatctcc cactcaaggc taccgggtga 3780 
cccccgatgc tgtgcattca gtgggaggga attcatcaca gagcagctcc cccagctcca 3840 
gcgtgcccag ttccccagcc ggctctgggc acacacggcc cagctccctc cacggtctgg 3900 
cacccaagct ccaacgccag taccgctctc cacggcgcaa gtcagcaggc agcatcccac 3960 
tgtcaccact ggcccacacc ccttctcccc cacccccaac agcttcacct cagcggtccc 4020 
catcgcccct gtctggccat gtagcccagg cctttcccac aaagcttcac ttgtcacctc 4080 
ccctgggcag gcaactctca cggcccaaga gtgcggagcc accccgttca ccactactca 4140 
agagggtgca gtcggctgag aaactggcag cagcacttgc cgcctctgag aagaagctag 4200 
ccacttctcg caagcacagc cttgacctgc cccactctga actaaagaag gaactgccgc 4260 
ccagggaagt gagccctctg gaggtagttg gagccaggag tgtgctgtct ggcaaggggg 4320 
ccctgccagg gaagggggtg ctgcagcctg ctccctcacg ggccctaggc accctccggc 4380 
aggaccgagc cgaacgacgg gagtcgctgc agaagcaaga agccattcgt gaggtggact 4440 
cctcagagga cgacaccgag gaagggcctg agaacagcca gggtgcacag gagctgagct 4500 
tggcacctca cccagaagtg agccagagtg tggcccctaa aggagcagga gagagtgggg 4560 
aagaggatcc tttcccgtcc agagacccta ggagcctggg cccaatggtc ccaagcctat 4620 
tgacagggat cacactgggg cctcccagaa tggaaagtcc cagtggtccc cacaggaggc 4680 
tcgggagccc acaagccatt gaggaggctg ccagctcctc ctcagcaggc cccaacctag 4740 
gtcagtctgg agccacagac cccatccctc ctgaaggttg ctggaaggcc cagcacctcc 4800 
acacccaggc actaacagca ctttctccca gcacttcggg actcaccccc accagcagtt 4860 
gctctcctcc cagctccacc tctgggaagc tgagcatgtg grtcctggaaa tcccttattg 4920 
agggcccaga cagggcatcc ccaagcagaa aggcaaccat ggcaggtggg ctagcceiacc 4980 
tccaggattt ggaaaacaca actccagccc agcctaagaa cctgtctccc agggagcagg 5040 
ggaagacaca gccacctagt gcccccagac tggcccatcc atcttatgag gatcccagcc 5100 
agggctggct atgggagtct gagtgtgcac aagcagtgaa agaggatcca gccctgagca 5160 
tcacccaagt gcctgatgcc tcaggtgaca gaaggcagga cgttccatgc cgaggctgcc 5220 
ccctcaccca gaagtctgag cccagcctca ggaggggcca agaaccaggg ggccatcaaa 5280 
agcatcggga tttggcattg gttccagatg agcttttaaa gcaaacatag cagttgtttg 5340 
ccatttcttg cactcagacc tgtgtaatat atgctcctgg aaaccatctt tatgtctttt 5400 
gcttgcttgt tttccttcgg tcaacccaca tgtaactagg tcctgtgttg ctgctgggaa 5460 
tatagtggtg aataaagcat 5480 

<210> 30 

<211> 1568 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID Nd: 5547067C31 

<400> 30 

caggaaaaaa agatatttta aatttgatgc tcatttttgt gtgtgtgtgt tgagtgcatg 60 
cattcaatct gtgtattcct ccctcatcaa cccagaacat cctgcagctg ccactctgag 120 
ggtggcccct tccttcctct gccctgaagc tgtatcacag agatttctag tcctagtgtg 180 
actctggccc catgctgatg ggtttctgca gactggaaga ggcggggctc gtgtcacgca 240 
gcatcaggga gaggaattgc ttatataact gggacagcag atttagcaga gagaggaggc 300 
agaggctggg aatgggagca gtaagctgtc ggcaggggca gcacacccag cagggggaac 360 
acacccgggt ggctgtccct cacaagggtg gcaacatccg gggtccctgg gcccgaggct 420 
ggaagagcct ctggacaggt ttgggaacca tcaggtcaga tctggaagaa ctctgggaac 480 
tacgggggca ccactatctg caccaggaat ccctaaagcc agccccagta ctggtagaga 540 
agcctctgcc agagtggcca gtgcctcagt tcatcaacct ctttctacca gagtttccca 600 
ttaggcccat tagggggcag cagcagctga agattttagg cctcgtggct aaaggctcct 660 
ttggaactgt cctcaaggtg ctagattgca cccagaaagc tgtatttgca gtgaaggtgg 720 
tgcccaaggt aaaggtccta cagagggata ccgtgaggca gtgcaaagag gaggttagca 780 
tccagcgaca gatcaaccat ccctttsftac acagcttggg ggacagctgg cagggaaaac 840 
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ggcacctttt cattatgtgt 
ttggctgctt tcctgaggct 
gttatctcca tgacttgggc 
atgaacgagg ccatctgaaa 
ctcaagccta cactatctgt 
gaccttacaa ccatgctgct 
ctggaaagtt tccagtggct 
acagtgactc tgagatccca 
tcttatgcca gaaccccctc 
tctttcgggg tgtggccttc 
cggagacaca agctacccag 
gtgatctgga gtccttcttg 
gggcccgg 

<210> 31 
<211> 2365 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> miscfeatvire 
<223> Incyte ID No: 71675660CB1 

<400> 31 

cccctttttt tttttttcta ggaaagaagg gagtttatca ctgtaactgg atacagggag 60 
aaggctggag ataattccag cagaccaact caaagtgcta caattttctt actgtttata 120 
taggttgggg ttatgtgcct acatgcagta cagcaatcac cteiagtctac tggtaactaa 180 
ttttgttcca aggagaaggt cagaggcaaa aaaaatgctt gctaagtccg attaaaaggg 240 
gcccagtgcc ttcaaggcct gtctactgtg gtaccggagt gattatttcg attgtatctc 300 
ctttacagct tggtccagag agctgcctta gactatccaa ttgatctatt caaacagctg 360 ' 
cctgttccct taacttgtct tcagattttg tcgacccgag atgggtcctg gcactaggaa 420 
tgtaaaaccg ttcctattat tttggcttgc tccagcaaaa gagaagccca tgcaaggctc 480 
ctgctgacca tgtttcattt ctagctttga tgtctgggca ctgatttccc tagatttaac 540 
tatgtgctca atggtaaggc agtgctgtgg aaatctgtct -gtgtaactgg ggtgctatgc 600 
aggcctgtct gggtgactgt cagggacaac tgtcctacca caccaaggac acagccctgg 660 
gggtgctttt cttcatagcc aaagaagctg caggaaaccc accctagtgg gacaaagacc 720 
aatgcagggt cagtccccac agccaggtga tgcaaacagg ctggacgtgg gccgcctccc 780 
ctccagcttg acttgtgaca gggaaaccaa tgcagcagca gcagggccac cagagtcctg 840 
tcctggggac aggcttcctt ccagcgggcg gggagtgggt gctcctgcca gaccagcctg 900 
gcttccacgg ttccagagac cctgttcccc ctcagcccag tccccgcccc cactccttgg 960 
ctttatgagt tcattggctg aagtcacccg gagacaatgc tgagtgttcc acccctgagt 1020 
cgaagcccag cccagggcag cccagccaga cgcctccggt agtgtaaatg aggacaatgc 1080 
ctgctggccc acatgacggg gggatgtaga cggcagcggc gccagtcgct cctggcacca 1140 
tggacgatgc cacagtccta aggaagaagg gttacatcgt aggcatcaat cttggcaagg 1200 
gttcctacgc aaaagtcaaa tctgcctact ctgagcgcct caagttcaat gtggctgtca 1260 
agatcatcga ccgcaagaeia acacctactg actttgtgga gagattcctt cctcgggaga 1320 
tggacatcct ggcaactgtc aaccacggct ccatcatcaa gacttacgag atctttgaga 1380 
cctctgacgg acggatctac atcatcatgg agcttggcgt ccagggcgac ctcctcgagt 1440 
tcatcaagtg ccagggagcc ctgcatgagg acgtggcacg caagatgttc cgacagctct 1500 
cctccgccgt caagtactgc cacgacctgg acatcgtcca ccgggacctc aagtgcgaga 1560 
accttctcct cgacaaggac ttcaacatca agctgtctga ctttggcttc tccaagcgct 1620 
gcctgcggga cagcaatggg cgcatcatcc tcagcaagac cttctgcggg tcggcagcat 1680 
atgcagcccc cgaggtgctg cagagcatcc cctaccagcc caaggtgtat gacatctgga 1740 
gcctgggcgt gatcctgtac atcatggtct gtggctccat gccctatgac gactccgaca 1800 
tcaggaagat gctgcgtatc cagaaggagc accgtgtgga cttcccgcgc tccaagaacc 1860 
tgacctgcga gtgcaaggac ctcatctacc gcatgctgca gcccgacgtc agtcagcggc 1920 
tccacatcga tgagatcctc agccactcgt ggctgcagcc ccccaagccc aaagccatgt 1980 
cttctgcctc cttcaagagg gagggggagg gcaagtaccg cgctgagtgc aaactggaca 2040 
ccaagacagg cttgaggccc gaccaccggc ccgaccacaa gcttggagcc aaaacccagc 2100 
accggctgct ggtggtgccc gagaacgaga acaggatgga ggacaggctg gccgagacct 2160 
ccagagccaa agaccatcac atctccggag ctgaggtggg gaaagcaagc acctagcatg 2220 
acaatggccc cgttgtgtgt ggtgggggtc ggggttgggg ggcatggtgc agtcggcctt 2280 
cacgtaaact aagtaggcag gtaggatctg aagaaggcac aggtgcaagt aaaattcgtc 2340 



agctactgca gcacagatct 
tccatccgtc tctttgctgc 
atcatgcatc gagatgtgaa 
ctgacagact ttggtctgtc 
ggcactcttc agtacatggc 
gattggtggt ccctgggtgt 
gcagagagag atcatgtggc 
gcttctctta accagggcct 
catcgtctac gttatctgca 
gacccagagc tcctacagaa 
cccagttcag cggagaccat 
ctctacccta tccctgcttg 



gtactccctt tggtcggctg 900 
cgagttggtg ctggtactgt 960 
gatggagaat attcttctag 1020 
ccgccacgtg ccccagggag 1080 
cccagaggtc ctaagtggag 1140 
cttgcttttc tctctggcga 1200 
catgttggca agtgtgaccc 1260 
ctcactcctg ctccatgagc 1320 
tcacttccag gtccaccctt 1380 
gcagccagtg aactttgtca 1440 
gccctttgac gactttgact 1500 
agcctctcta ctgtaaattg 1560 

1568 
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aattaaacca ctattttgat taaaa 2365 

<210> 32 

<211> 2626 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 71678683CB1 

<400> 32 

cccctttttt tttttttcta ggaaagaagg gagtttatca ctgtaactgg atacagggag 60 
aaggctggag ataattccag cagaccaact caaagtgcta caattttctt actgtttata 120 
taggttgggg ttatgtgcct acatgcagta cagcaatcac ctaagtctac tggtaactaa 180 
ttttgttcca aggagaaggt cagaggcaaa aaaaatgctt gctaagtccg attaaaaggg 240 
gcccagtgcc ttcaaggcct gtctactgtg gtaccggagt gattatttcg attgtatctc 300 
ctttacagct tggtccagag agctgcctta gactatccaa ttgatctatt caaacagctg 360 
cctgttccct taacttgtct tcagattttg tcgacccgag atgggtcctg gcactaggaa 420 
tgtaaaaccg ttcctattat tttggcttgc tccagcaaaa gagaagccca tgcaaggctc 480 
ctgctgacca tgtttcattt ctagctttga tgtctgggca ctgatttccc tagatttaac 540 
tatgtgctca atggtaaggc agtgctgtgg aaatctgtct gtgtaactgg ggtgctatgc 600 
aggcctgtct gggtgactgt cagggacaac tgtcctafcca caccaaggac acagccctgg 660 
gggtgctttt cttcatagcc aaagaagctg caggaaaccc accctagtgg gacaaagacc 720 
aatgcagggt cagtccccac agccaggtga tgcaaacagg ctggacgtgg gccgcctccc 780 
ctccagcttg acttgtgaca gggaaaccaa tgcagcagca gcagggccac cagagtcctg 840 
tcctggggac aggcttcctt ccagcgggcg gggagtgggt gctcctgcca gaccagcctg 900 
gcttccacgg ttccagagac cctgttcccc ctcagcccag tccccgcccc cactccttgg 960 
ctttatgagt tcattggctg aagtcacccg gagacaatgc tgagtgttcc acccctgagt 1020 
cgaagcccag cccagggcag cccagccaga cgcctccggt agtgtaaatg aggacaatgc 1080 
ctgctggccc acatgacggg gggatgtaga cggcagcggc gccagtcgct cctggcacca 1140 
tggacgatgc cacagtccta aggaagaagg gttacatcgt aggcatcaat cttggcaagg 1200 
gttcctacgc aaaagtcaaa tctgcctact ctgagcgcct caagttcaat gtggctgtca 1260 
agatcatcga ccgcaagaaa acacctactg actttgtgga gagattcctt cctcgggaga 1320 
tggacatcct ggcaactgtc aaccacggct ccatcatcaa gacttacgag atctttgaga 1380 
cctctgacgg acggatctac atcatcatgg agcttggcgt ccagggcgac ctcctcgagt 1440 
tcatcaagtg ccagggagcc ctgcatgagg acgtggcacg caagatgttc cgacagctct 1500 
cctccgccgt caagtactgc cacgacctgg acatcgtcca ccgggacctc aagtgcgaga 1560 
accttctcct cgacaaggac ttcaacatca agctgtctga ctttggcttc tccaagcgct 1620 
gcctgcggga cagcaatggg cgcatcatcc tcagcaagac cttctgcggg tcggcagcat 1680 
atgcagcccc cgaggtgctg cagagcatcc cctaccagcc caaggtgtat gacatctgga 1740 
gcctgggcgt gatcctgtac atcatggtct gcggctccat gccctatgac gactccgaca 1800 
tcaggaagat gctgcgtatc cagaaggagc accgtgtgga cttcccgcgc tccaagciacc 1860 
tgacctgcga gtgcaaggac ctcatctacc gcatgctgca gcccgacgtc agccagcggc 1920 
tccacatcga tgagatcctc agccactcgt ggctgcagcc ccccaagccc aaagccacgt 1980 
cttctgcctc cttcaagagg gagggggagg gcaagtaccg cgctgagtgc aaactggaca 2040 
ccaagacagg cttgaggccc gaccaccggc ccgaccacaa gcttggagcc aaaacccagc 2100 
accggctgct ggtggtgccc gagaacgaga acaggatgga ggacaggctg gccgagacct 2160 
ccagggccaa agaccatcac atctccggag ctgaggtggg gaaagcaagc acctagcatg 2220 
acaatggccc cgttgtgtgt ggtgggggtc ggggttgggg ggcatggtgc agtcggcctt 2280 
cacgtaaact aagtaggcag gtaggatctg aagaaggcac aggtgcaagt aaaattcgtc 2340 
aattaaacca ctattttgat tacgttccat tagctttctt ccacttagca gcaaagacgt 2400 
tccttactga ccaccaaata aaccacaggg tgtgtgcaag catcaagagt gcccagtgag 2460 
gagtgttttt ctctgggact cagccaaccg ccccacctga cacacagtgg tctccggcct 2520 
aggagcacag gacagatgct caggtacagg cagaatcaca gtgtggcctg gccttgtggg 2580 
ggacaagagg gcctctgcca gggtccaccc accaggccca cactgt 2626 

<210> 33 
<211> 3961 
<212> nNA 

<213> Homo sapiens 
<220> 
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<221> misc^feature 

<223> Incyte ID No: 7474567CB1 

<400> 33 

ccctgtaata cgaactcact atagggcgac cagtgtgctg gaaagcggcc gcgggggcgg 60 
cggaggatat ggagtaaagc cagagtcagt ggccaggcac gaaggcagag caggaacagc 120 
caggaggcgt ttattagggg ggcgggggga aagagcccca gcaccgcccc tcctggaaga 180 
aggaagaggt aactataact acccaatatt gcagccatgg agtccatgct taataaattg 240 
aagagtactg ttacaaaagt aacagctgat gtcactagtg ctgtaatggg aaatcctgtc 300 
actagagaat ttgatgttgg tcgacacatt gccagtggtg gcaatgggct agcttggaag 360 
atttttaatg gcacaaaaaa gtcaacaaag caggaagtgg cagtttttgt ctttgataaa 420 
aaactgattg acaagtatca aaaatttgaa aaggatcaaa tcattgattc tctaaaacga 480 
ggagtccaac agttaactcg gcttcgacac cctcgacttc ttactgtcca gcatccttta 540 
gaagaatcca gggattgctt ggcattttgt acagaaccag tttttgccag tttagccaat 600 
gttcttggta actgggaaaa tctaccttcc cctatatctc cagacattaa ggattataaa 660 
ctttatgatg tagaaaccaa atatggtttg cttcaggttt ctgaaggatt gtcattcttg 720 
catagcagtg tgaaaatggt gcatggaaat atcactcctg aaaatataat tttgaataaa 780 
agtggagcct ggaaaataat gggttttgat ttttgtgtat catcaaccaa tccttctgaa 840 
caagagccta aatttccttg taaagaatgg gacccaaatt taccttcatt gtgtcttcca 900 
aatcctgaat atttggctcc tgaatacata ctttctgtga gctgtgaaac agccagtgat 960 
atgtattctt taggaactgt tatgtatgct gtatttaata aagggaaacc tatatttgaa 1020 
gtcaacaagc aagatattta caagagtttc agtaggcagt tggatcagtt gagtcgttta 1080 
ggatctagtt cacttacaaa tatacctgag gaagttcgtg aacatgtaaa gctactgtta 1140 
aatgtaactc cgactgtaag accagatgca gatcaaatga caaagattcc cttctttgat 1200 
gatgttggtg cagtaacact gcaatatttt gataccttat tccaaagaga taatcttcag 1260 
aaatcacagt ttttcaaagg actgccaaag gttctaccaa aactgcccaa gcgtgtcatt 1320 
gtgcagagaa ttttgccttg tttgacttca gaatttgtaa accctgacat ggtacctttt 1380 
gttttgccca atgttctact tattgctgag gaatgcacca aagaagaata tgtcaaatta 1440 
attcttcctg aacttggccc tgtgtttaag cagcaggagc caatccagat tttgttaatt 1500 
ttcctacaaa aaatggattt gctactaacc aaaacccctc ctgatgagat aaagaacagt 1560 
gttctaccca tggtttacag agcactagaa gctccttcca ttcagatcca ggagctctgt 1620 
ctaaacatca ttccaacctt tgcaaatctt atagactacc catccatgaa aaacgctttg 1680 
ataccaagaa ttaaaaatgc ttgtctacaa acatcttccc ttgcggttcg tgtaaattca 1740 
ttagtgtgct taggaaagat tttggaatac ttggataagt ggtttgtact tgatgatatc 1800 
ctacccttct tacaacaaat tccatccaag gaacctgcgg tcctcatggg aattttaggt 1860 
atttacaaat gtacttttac tcataagaag ttgggaatca ccaaagagca gctggccgga 1920 
aaagtgttgc ctcatcttat tcccctgagt attgaaaaca atcttaatct taatcagttc 1980 
aattctttca tttccgtcat aaaagaaatg cttaatagat tggagtctga acataagact 2040 
aaactggagc aacttcatat aatgcaagaa cagcagaaat ctttggatat aggaaatcaa 2100 
atgaatgttt ctgaggagat gaaagttaca aatattggga atcagcaaat tgacaaagtt 2160 
tttaacaaca ttggagcaga ccttctgact ggcagtgagt ccgaaaataa agaggacggg 2220 
ttacagaata aacataaaag agcatcactt acacttgaag aaaaacaaaa attagcaaaa 2280 
gaacaagagc aggcacagaa gctgaaaagc cagcagcctc ttaaacccca agtgcacaca 2340 
cctgttgcta ctgttaaaca gactaaggac ttgacagaca cactgatgga taatatgtca 2400 
tccttgacca gcctttctgt tagtacccct aaatcttctg cttcaagtac tttcacttct 2460 
gttccttcca tgggcattgg tatgatgttt tctacaccaa ctgataatac aaagagaaat 2520 
ttgacaaatg gcctaaatgc caatatgggc tttcagactt caggattcaa catgcccgtt 2580 
aatacaaacc agaacttcta cagtagtcca agcacagttg gagtgaccaa gatgactctg 2640 
ggaacacctc ccactttgcc aaacttcaat gctttgagtg ttcctcctgc tggtgcaaag 2700 
cagacccaac aaagacccac agatatgtct gcccttaata atctctttgg ccctcagaaa 2760 
cccaaagtta gcatgaacca gttatcacaa cagaaaccaa atcagtggct taatcagttt 2820 
gtacctcctc aaggttctcc aactatgggc agttcagtaa tggggacaca gatgaacgtg 2880 
ataggacaat ctgcttttgg tatgcagggt aatcctttct ttaacccaca gaactttgca 2940 
cagccaccaa ctactatgac caatagcagt tcagctagca atgatttaaa agatcttttt 3000 
gggtgaggtg tcttacttct attttgaagg attatttcag tttcaatcat gggtgagctg 3060 
atttacatct ttatatagtt ggcttggagg eiagtacttct atgggaaagt gaacagttct 3120 
gtgacaggaa acatctctgt ccatgccagc atagtagttg tatggacttc taaccagttg 3180 
agttttttaa agcattgagg attttttcct cttaccaact cctcttcagg tttttaaaga 3240 
cccagccctt cccaatctca aagagaaaaa ggaaactgag ttatcttgeia taacataact 3300 
ttttaatcaa atgtttattt tggcttgtgg atcttggtgt tatttaaaaa attgaggtga 3360 
tggtcattgc aagctcatct attaagtact atatggtaca cagtctatga gtcattagtc 3420 
ttcattttaa tatgtaaaaa atcttgatgc tgtattgatt tgtttgcatt taagatgaca 3480 
gtgagaaaat gataagcata aagagaagta tcaggttatt tgctttttcc aaacttttca 3540 
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gatgaactat tgtttagtac 
tttcattggt ttaaatgcgt 
attttagcat ggtctgcctc 
agaatcactt ttttcctcct 
gacaaggcac tctgatgtga 
acttctgagg ttgcagcata 
ctatatttat gagtaaactt 
a 

<210> 34 
<211> 2210 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> misc^feature 
<223> Incyte ID No: 3838946CB1 

<400> 34 

ccagagggcc ggcatgtggt ctgcagaaga ggaggacgtg gctgggtggc agggctggtc 60 
tccaaggacg agtaagatcc tgcagctgca ggcttcagga agtctcctgg ggctatcaga 120 
tggctccttc ttgtagcagc agctgtgggg tccactggcc ctgagccctc agaggggcgg 180 
ccgtggggga cctcctgtct tttgccttgc aagggcctca gttgtgcttt ttccctctag 240 
gcagccatgg gtgccaggca gtgctgagag cagtggggca tggctgcagc cctgcaggtc 300 
ctgccccgct tggcccgagc ccccttgcat ccactcctct ggcggggctc agtggcccgt 360 
ctggccagca gcatggcctt ggcagagcag gccaggcagc tgtttgagag tgctgtaggrt 420 
gcagtgctgc cgggccccat gctgcaccgg gcactatcct tggaccctgg tggcagacag 480 
ctgaaggtgc gggaccggaa ctttcagctg aggcaaaacc tctacctggt gggctttggc 540 
aaggctgtgc tgggtatggc agctgcagct gaggaactac tgggccagca tcttgtgcag 600 
ggcgtgatca gcgttcccaa ggggatccgt gctgccatgg agcgtgccgg caagcaggag 660 
atgctgctga agccacatag ccgtgtccag gtattcgagg gtgcggagga caacctcccg 720 
gaccgcgatg cgctgcgggc tgcactggcc atccagcaac tggctgaggg actcacagct 780 
gatgacctgc tgctcgtgct gatctcaggt gggggttcag ctctgctgcc tgcccccatc 840 
ccacctgtca cactggagga gaagcagaca ctcactagac tgctggcagc ccgtggagcc 900 
accatccagg agttgaacac cattcggaag gccctgtccc agctcaaggg tggggggctg 960 
gctcaggccg cctaccctgc ccaggtggtg agcctcatcc tgtcagatgt ggtgggggac 1020 
cctgtggagg tgattgccag tggccccacc gtggccagtt cccacaatgt gcaagattgc 1080 
ctgcatatcc tcaatcgcta cggcctccgt gcagccctgc cacgttctgt gaagactgtg 1140 
ctgtctcggg ccgactctga cccccatggg ccacacacct gtggccatgt cctgaatgtg 1200 
atcattggct ctaatgtgct ggcgctagct gaggcccagc ggcaggccga ggcactgggc 1260 
taccaggctg tggtgctgag tgcagccatg caaggtgatg taaaaagtat ggcccagttc 1320 
tacgggctgc tggcccatgt ggctagaacc cgcctcaccc catccatggc tggggcttct 1380 
gtggaggaag atgcacagct ccatgagctg gcagctgagc ttcagatccc agacctgcag 1440 
ctggaggagg ctctggagac catggcatgg ggaaggggcc cagtctgcct gctggctggt 1500 
ggcgagccca cagtacagct gcagggctcg ggcaggggtg gccggaacca ggaactggcc 1560 
ctgcgtgttg gagcagagtt gagaaggtgg ccgctggggc cgatagatgt gctgtttttg 1620 
agcggtggca ccgatgggca ggatgggccc acagaggctg ctggggcctg ggtcacacct 1680 
gagcttgcca gccaggctgc agctgagggc ctggacatag ccaccttcct agcccacaat 1740 
gactcacata ccttcttctg ctgcctccag ggtggggcac acctgctgca cacagggatg 1800 
acaggtacca atgtcatgga cacccacctc ttgttcctgc ggcctcggtg atggcatagg 1860 
tcacattttg ggagttcaga ggaggcctac aagggcaagg tcagatggca gagcaaggtt 1920 
ggtcctcagg gcctctctaa gccttagggc ccctcctctc cttggccttg gctgtttggt 1980 
taactgtcac cttccactca gggcctctgc tctatatcta ttcccttcca gccagactgg 2040 
cagatggggg cttcccccta cccctgagga tgaggacaag cccctcggcc agttcagcgt 2100 
tcccgtgctt ctcccttggg cagcctctct cttgagcccc tcaccctgtt tctttctgtg 2160 
aagcgagaat gtctgaaaat ciaataggacc atgccaaaaa aaaaaaaaaa 2210 

<210> 35 
<211> 4869 
<212> DNA 

<213> Homo sapiens 
<220> 



agagactgag caaatactac aaaattcaac ttaaccttca 3600 
tattaaccat cttaagtgca aactaatcat tgtaaattat 3660 
aaatagtaat gtatttttct gcattcactt ggatatattt 3720 
gtatcaagga agaggtatgt gctgatttgt ttggatattt 3780 
cttccctgac tactaccttc atatttcatt tcaaattcaa 3840 
tatgaattgc attttcaaaa gaagatttgt aagaattaaa 3900 
ttgaggtttc tgctgtattg tttcaeiatgt aataaacttt 3960 

3961 
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<221> miBC_feature 

<223> Incyte ID No: 72001176CB1 

<400> 35 

ctgcgcttpt cgcgaaacgg caggcatcgc ggggctggcc acttccgtac ttccgctttc 60 
cggcccagcc agcgcccgcg atgactgcca ctctccgccc ctacctgagt gccgtgcggg 120 
ccacattgca ggctgccctc tgcctggaga acttctcctc ccaggttgtg gaacgacaca 180 
acaagccgga agtggaagtc aggagtagca aagagctcct gttacaacct gtgaccatca 240 
gcaggaatga gaaggaaaag gttctgattg agggctccat caactctgtc cgggtcagca 300 
ttgctgtgaa acaggctgat gagatcgaga agattttgtg ccacaagttc atgcgcttca 360 
tgatgatgcg agcagagaac ttctttatcc ttcgaaggaa gcctgtggag gggtatgata 420 
tcagctttct gatcaccaac ttccacacag agcagatgta caaacacaag ttggtggact 480 
ttgtgatcca cttcatggag gagattgaca aggagatcag tgagatgaag ctgtcagtca 540 
atgcccgtgc ccgcattgtg gctgaagagt tccttaagaa tttttaaacc atctggctgg 600 
atctcgtggc cttccccctc agactaccca tgtctccacg aaggcgtcct ggagtcactc 660 
cccgcgctgc tctacccacc cgcccctcgg ctcctcgcct tccccctccc gtccgccttc 720 
tcccctccct cccgctcctg ggaaagagag aaaccaccgc tgcgggtggg tagagaagca 780 
cttggcgcct cggggagggg accgcgcccg cctcatttgc gccttgcagc actgctggac 840 
caggttacaa gatgttcacc taagattgag acctagtgac tacatttcct acgggaacaa 900 
ataaatggtt tttcatctcc cggagataca ttacaaacaa atatggtgct aaaagaactc 960 
cttacctttc tctgactaca atttatttgg acatactttt gtattgaaga gaggtataca 1020 
tactgaagct acttgctgta ctataggaga ctctgtcctg taggatcatg gaccatccta 1080 
gtagggaaaa ggatgaaaga caacggacaa ctaaacccat ggcacaaagg agtgcacact 1140 
gctctcgacc atctggctcc tcatcgtcct . ctggggttct tatggtggga cccaacttca 1200 
gggttggcaa gaagatagga tgtgggaact tcggagagct cagattaggt aaaaatctct 1260 
acaccaatga atatgtagca atcaaactgg aaccaataaa atcacgtgct ctacagcttc 1320 
atttagagta cagattttat aaacagcttg gcagtgcagg tgaaggtctc ccacaggtgt 1380 
attactttgg accatgtggg aaatataatg ccatggtgct ggagctcctt ggccctagct 1440 
tggaggactt gtttgacctc tgtgaccgaa catttacttt gaagacggtg ttaatgatag 1500 
ccatccagtt gctttctcga atggaatacg tgcactcaaa gaacctcatt taccgagatg 1560 
tcaagccaga gaacttcctg attggtcgac aaggcaataa gaaagagcat gttatacaca 1620 
ttatagactt tggactggcc aaggaataca ttgaccccga aaccaaaaaa cacatacctt 1680 
atagggaaca caaaagttta actggaactg caagatatat gtctatcaac acgcatcttg 1740 
gcaaagagca aagccggaga gatgatttgg aagccctagg ccatatgttc atgtatttcc 1800 
ttcgaggcag cctcccctgg caaggactca aggctgacac attaaaagag agatatcaaa 1860 
aaattggtga caccaaaagg aatactccca ttgaagctct ctgtgagaac tttccagagg 1920 
agatggcaac ctaccttcga tatgtcaggc gactggactt ctttgaaaaa cctgattatg 1980 
agtatttacg gaccctcttc acagacctct ttgaaaagaa aggctacacc tttgactatg 2040 
cctatgattg ggttgggaga cctattccta ctccagtagg gtcagttcac gtagattctg 2100 
gtgcatctgc aataactcga gaaagccaca cacataggga tcggccatca caacagcagc 2160 
ctcttcgaaa tcagaatgta tcatcagagc gccgaggaga gtgggaaatt cagcccagcc 2220 
ggcagaccaa tacctcatac ctaacgtctc acttggctgc agaccgccat gggggatcag 2280 
tgcaggtggt tagctcaacc aatggagagc tgaatgttga tgatcccacg ggagcccact 2340 
ccaatgcacc aatcacagct catgccgagg tggaggtagt ggaggaagct aagtgctgct 2400 
gtttctttaa gaggaaaagg aagaagactg ctcagcgcca caagtgacca gtgcctccca 2460 
ggagtcctca ggccctgggg actctgactc aattgtacct gcagctcctg ccatttctca 2520 
ttggaaggga ctcctctttg ggggagggtg gatatccaaa ccaaaaagaa gaaaacagat 2580 
gcccccagaa ggggccagtg cgggcagcca gggcctagtg ggtcattggc catctccgcc 2640 
tgcctaaggc tctgagcagg tcccagagct gctgttcctc cactgcttgc ccatagggct 2700 
gcctggttga ctctccttcc cattgtttac agtgaaggtg tcattcacaa aaactcaagg 2760 
actgctattc tccttcttcc ccttagttta ctcctggttt ttaccccacc ctcaaccctc 2820 
tccagcataa aacctagtga gctaaaggct ttgtctgcag aaggagatca agaggctggg 2880 
ggtaaggcca agaaggtagg aggaaaatgg cagacctggg ctggagaaga accttctccg 2940 
tatcccaggt gtgcctggca gtatggtttc ctcttcctct gtgcctgtgc agcattcatc 3000 
ccagctggcc ttggggttca ggttccttct tccctccctc ctgtgaagtt acactgtagg 3060 
acacaagctg tgagcaatct gcagtctact gtccctgtgt gttggcgttc ttagcttttt 3120 
tgacaaactc ttttctccag gtagtaggac aatgaaaatt gttctaagca aaggaaagaa 3180 
aactgacttt gttgcacttt tagttttttt aaaaaaaacc aaaacaaaac atggcagatg 3240 
catattgtgt ctggttatat tgggggtttt acttttacct gttttgaggg ggatggggcc 3300 
ggccaagcca ttcagagaga acatgggtcc agaggacatt ctcagtggaa agagtttgat 3360 
ctgcagcacc cagaagagaa gccaactcgg tgtcattctg agtgaacact caggttggca 3420 
agaaaacata cttgaatttt cattcatctt ctcagcagct gaagaatgtc cctaccagag 3480 
catcttgacc taatcagctt acagtttgaa aacctagctc tccagaacat gagatgagcc 3540 
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agccgagcca gactgtgacc aggaaacagc tcatcccaga gaaggagatg cttaacaaaa 3600 
aaaaattgaa attgtttccc atgctgccag ggacttccaa ctagatagcc atgtgacgtc 3660 
ctggtgactt gggggaaaaa ttagtgatga aacagccacc accatattgc cattagtgga 3720 
aaaaaagagg acagtgaacc tgccttccac ctgccagagg gacctcaggg tgtggcatta 3780 
tagggccagg aaaagaaaat cggtgtatcc tatctgcccc aatagctgag ctgtagcatt 3840 
tgggctggcc tgccttatca gaaaccaagc ttatgaagat cttctcccag caggtccata 3900 
gcagtaggct taggatgcag tatatggggc cgcatttaaa aggagggaaa gattgtttgg 3960 
tgctggaaca ttccagggaa aaggagactg gaatgaaagg tctgaaatta tcttctcaat 4020 
tggactcctt ccagaaaggt ggccgtgcct ctaagcatgt ttttcccagt atgccctagg 4080 
cctcccccca tggtgttttc atatgaggta ctactgtgaa ggatctggtt cctcattcac 4140 
tgtttgacaa gtctttcatg tgtggagtta ctcttctcat gcccaatttt catttgagtt 4200 
tagtggctta accaaacaat gac tec teat tccagcggtg acagaagaga aagggteatt 4260 
tacatcagga aagaggtctt gtatctggga gtagagagct aaccatggag cacagtggct 4320 
ggtgggtgac ttagtctgat ggtttgtgga ccatagaagt ctteacctct ggtttgaggt 4380 
gcagggctgt cttttgtact ggagggtgtg gggatatttt etgatagttg ccatttcttg 4440 
aaaaattccc ttgatgtacc ttacacagag cagaaataac attaacatgg atcagaggta 4500 
ctgggcttca tctgttccat tggaccttgg ctagggaata tcatttcact ggcatcaaac 4560 
ctgcttagct tatgaaaaga tggtaatatg tcatttctat aaatgtttct atatatgaaa 4620 
cataaaagtg gcagggagat acaatatcac acccctttcc cccaaaggac tgtgaaaaat 4680 
tgggggttta tggcccttgc caatteeeta gtgggttaaa agcccctatt cettaaaatt 4740 
ttaacatcgg tttcctccaa tttgggggtt ttgggggatt tgtccaactt aacctggtta 4800 
gggaaaagtt taacatggtc ccctcacccc cccctgtttg gagaaaagcc cctgtgcctc 4860 
ccccaaaga 4869 

<210> 36 
<211> 4480 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> miscfeature 

<223> Incyte ID No: 55064363CB1 

<400> 36 

atgaagtggg taggggacac 
gggctctect ccagaceggg 
gggaagggcg egcaggcggg 
ggggcggcga gcgagtccee 
ccggcggagc ccgaeggggc 
gggggcgggc cgcggcgggc 
ggcgcggccg geggcccgga 
gagggcgctc acetcaectc 
ctcgacgcct tctacgacgc 
ccttccctct tctaccatct 
ttgtaccatg acaccgatgc 
aacacagcat ccagtggaaa 
tatttttgct gcgagagtga 
gacaacatcc tgggeccgct 
gacatccacg tgacetcatg 
gccagagaga aataccaagg 
atggataata ctgaggttct 
gatatccagg actatgatgc 
tgtgatttgg ccgatcagca 
aacagcacag gtgaccgtga 
gatcacccgg gccccgacat 
gattcagact gcaaagatga 
tttgaactcc agtcatccct 
ggacaacaat ttgaaacttc 
ttgggaagaa aagggagctt 
^^gcgtcagca tgctggceca 
aaactgaaac ctccagtctg 
cgcttcaaga aaaccattat 
gatataattt ttgaggcaac 



tggagtgggg ggaaaeatcc 
tgctatggtg gcggatcgea 
cacatggaga gcggcggtgg 
tcagtgcccg ccgccgcegg 
ggcggagggc gcggcaggeg 
tctgcgggca gtatacgtgc 
ggctggggcg cggcagtgcc 
cgtgcccttc ggggagctgg 
agatgttgct gtggtagaca 
tggagtccga gaaagctttg 
cgacactgct ctctctttga 
ttattatttc atcccataea 
tgcecagaga cgagectccg 
gtgcatgect ttggtggaea 
tgtttattac aaagaaacct 
tgaggaactg gcgaaggagc 
gacctcagac atcatcatta 
gatggtgaag ctggtggaaa 
taacattaaa ttccactatg 
gaaggctctg eagatcatgc 
gttctgcctg tgtgggagga 
caccagccgc gacagcgcca 
ctattcggga attaatcttg 
cttggaacta aggaaaatag 
ggagaaaatg aacaattact 
tgatgtcggg aaagccgtce 
gtacctgcga tcattagttc 
tgaacactcg cccaggcaag 
aaatgaagtc actaatggac 



etccatcctt caetacecca 60 
gccgctggcc actcgeccag 120 
aatgctecgg ecggggcctc 180 
gggtggaggg cgcggccggg 240- 
gcageggcga gggcgagagt 300 
gcagtgagag ctcccagggc 360 
tgctgcgggc ctgcgaggce 420 
acttcgggga gacggccgtg 480 
tgagcgatgt ctccagacag 540 
acatggccaa taatgtgatc 600 
aggacatggt aactcaaaaa 660 
tcgtgacacc gtgcactgat 720 
agtacatgca geccaactgg 780 
ggtteattag cctcettaag 840 
tgttaaatga catceggaaa 900 
tageteggat caagctccgc 960 
acttactcct gtcctaccgt 1020 
cactggagat gctgcctacg 1080 
cgtttgcact gaataggaga 1140 
tccaggttct gcagagctgt 1200 
tctacaagga catcttcttg 1260 
ttgagtggta tcgcaaaggg 1320 
cagttttget gattgttgct 1380 
gtgtecggct gaacagtttg 1440 
gggatgtggg tcagttcttc 1500 
aggcagcaga gaggttgttc 1560 
agaacttgtt actaattcgg 1620 
agcggctgaa cttctggtta 1680 
tcagatttcc agttctggtc 1740 



60/67 



wo 02/46184 



PCT/USOl/47431 



atagagccaa ccaaagtgta ccagccttct tatgtttcca taaacaatga agccgaggag 1800 
agaacagttt ctttatggca tgtctcaccc acagaaatga aacagatgca cgaatggaat 1860 
tttacagcct cttccataaa gggaataagc ctatcaaagt ttgatgaaag gtgttgtttt 1920 
ctttatgtcc atgataattc tgatgacttt caaatctact tttccaccga agagcagtgc 1980 
agtagatttt tctctttggt caaagagatg ataaccaata cagcaggcag tacggtggag 2040 
ctggagggag agaccgatgg agacaccttg gagtatgagt atgaccatga tgcaaatggt 2100 
gagagagttg tcttggggaa aggcacgtat gggattgtgt atgctggccg agatctgagc 2160 
aatcaagtgc gaatagccat caaagaaatc ccggagagag atagcaggta ttctcagcct 2220 
ctgcacgagg agatagccct gcacaagtac cttaagcacc gcaatatcgt tcagtacctg 2280 
ggctctgttt cagagaacgg ctacattaag atatttatgg agcaggtgcc tggaggaagc 2340 
ctttctgctc ttctgcgatc caaatggggg ccgatgaagg aaccgacaat caagttttac 2400 
accaaacaga tcctggaggg ccttaagtat cttcatgaaa accagatcgt gcacagagac 2460 
ataaagggcg ataatgttct ggtgaacacc tacagcggag tggtgaaaat ctccgatttt 2520 
ggaacctcga aacgtcttgc gggtgtgaac ccctgcacag agacttttac tggcaccctg 2580 
cagtacatgg cacctgagat aattgaccaa gggcctcgcg gatatggtgc cccagccgat 2640 
atctggtccc tgggctgcac catcattgag atggccacca gcaagcctcc gttccatgag 2700 
cttggtgagc cgcaggcagc catgttcaaa gtgggcatgt ttaagatcca ccctgagatt 2760 
ccagaagccc tttcagctga agcccgagcc ttcattttat cctgtttcga gcctgacccc 2820 
cacaaacgtg ccaccactgc tgagctactg agagagggtt tcttaaggca ggtgaacaag 2880 
ggcaagaaga accgaattgc cttcaagccc tcagaaggtc cccgcggtgt cgtcctggcc 2940 
ctgcccacac agggagagcc catggccacc agcagcagcg agcacggctc tgtctcccca 3000 
gactccgacg cccagcctga cgcactcttt gagaggaccc gggcgcccag gcaccacctt 3060 
ggccacctcc tcagtgttcc agacgagagc tcagccttgg aagaccgggg cttggcctcg 3120 
tccccggagg acagggacca gggcctcttc ctgctacgca aggacagtga gcgccgtgcc 3180 
atcctgtaca aaatcctctg ggaggagcag aaccaggtgg cttccaacct gcaggagtgt 3240 
gtggcccaga gttccgaaga gttgcatctc tcagttggac acatcaagca aatcattggg 3300 
atcctgaggg acttcatccg ctccccagag caccgggtga tggcgaccac aatatcaaag 3360 
ctcaaggtgg acctggactt tgacagctcg tccatcagtc agattcacct ggtgctgttc 3420 
ggatttcagg atgccgtaaa taaaattttg aggaaccact taattaggcc ccactggatg 3480 
ttcgcgatgg acaacatcat ccgccgagcg gtgcaggccg cggtcaccat tctcatccca 3540 
gagctccgag cccactttga gcctacctgt gagactgaag gggtagataa ggacatggat 3600 
gaagcggaag agggctatcc cccagccacc ggacctggcc aggaggccca gccccaccag 3660 
cagcacctga gcctccagct gggtgagctc agacaggaga ccaacagact tttggaacac 3720 
ctagttgaaa aagagagaga gtaccagaat cttctgcggc aaactctaga acagaaaact 3780 
caagaattgt atcaccttca gttaaaatta aaatcgaatt gtattacaga gaacccagca 3840 
ggcccctacg ggcagagaac agataaagag cttatagact ggttgcggct gcaaggagct 3900 
gatgcaaaga caattgaaaa gattgttgaa gagggttata cactttcgga tattcttaat 3960 
gagatcacta aggaagatct aagatacctt cgactacggg gtggtctcct ctgcagactc 4020 
tggagtgcgg tctcccagta cagaagggct caggaggcct cagaaaccaa agacaaggct 4080 
tgataccaat cagctaagct gtggcagagt gtcccaccac gctacatgtt ttgttaaagc 4140 
ttctgttagt gtatacacga attccgctgt gtttacatat ttaaaaatgc cattgttcaa 4200 
ttaatagttt aagaacttgt tttaaatact gtcctgagtt tcttttgaaa cctgttattt 4260 
ataaacatag aactgtgtgt attgrtgaaaa cagtgagcct tggttttgac ctcccggaat 4320 
attaggaaat tcacttgtag tcccagctat gcaggaggct gaggtgggag gattgcttga 4380 
gcccaggagg tgtggaggct gcagtgagcc atgatcacac cactgcactc cagcctgggc 4440 
aacagagccc gacctgtctc aaaaaaaagt acacccttca 4480 

<210> 37 

<211> 4415 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 7482044CB1 

<400> 37 

cgagacgtcc ccggcacgct gatggagccc gggcgcggcg cggggcccgc gggcatggcg 60 
gagcctcggg cgaaggcggc gcggccgggg ccccagcgct ttctgcggcg cagcgtggta 120 
gagtcggacc aggaggagcc gccgggcttg gaggcagccg aggcgccggg cccgcagccc 180 
ccgcagcccc tgcagcgccg ggtgcttctg ctctgcaaga cgcgccgcct catcgcggag 240 
cgcgcccgcg gacgccccgc cgcccccgcg cccgcagcgc tggtagcgca gccgggagcc 300 
cccggagccc ccgcggacgc cggccccgag cccgtgggca cgcaggagcc cggcccggac 360 
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cccatcgcag ccgctgtcga aaccgcgcct 
gcggcgaccg tgaggaagga ggatgagggg 
actcgccggg acgagcccga agaggaggag 
acctctctgg acggccgctt cctcaagttc 
acggtctaca aggggctgga cacggagacc 
gaccggaagc tcaccaagct ggagcggcag 
ggcctgcagc accccaacat cgtgcgcttc 
aagcggtgca ttgtgctggt gacggagctg 
aagcggttca aggtgatgaa gcccaaggtt 
ggcctgctgt tcctgcacac aaggacgcca 
aatattttca tcaccggacc aactgggtct 
ctgaaaagag cgtcatttgc caaaagtgtg 
atgtacgagg agcactacga tgagtccgtg 
gagatggcca cctcggagta cccctactcg 
aaggtcacct gtggtatcaa gccggccagc 
gagattattg gggagtgtat ctgcaaaaac 
ctgagccacg ccttcttcgc agaggacaca 
cacggcagga agtccaccat cgccctgagg 
ggaaagccca aggacaatgg agccatagag 
gatgaggtgg cccaagagat gattgagtct 
gtggccaagt ccatccgtga ccgcgtggcc 
cccgcgctgc agcccaagga gcagcaggat 
ccggtgcccc tgcaggtcca ggtgacctac 
gagcccgagg agccggaggc cgaccagcac 
acctccctgg cctcggacag caccttcgac 
tcgcagagca gccagcagag cgtgatgctt 
gcccagtgtg tgtgcagccc ccctgtgagc 
tcgctggggg cctaccagca gcccaoggct 
gcccccgcct gccctccgtc cctccagcag 
cccgtgctgc cgccgcccag cacccccatg 
ggccagcagc ctcctccgct ggcccagccg 
cccgtggtcc ccctccagcc ggttcccccc 
caggtggggg cccccgctca gctgaagccc 
cttgctcaag tccctccgca gatgcccccg 
gcgggaatcg acggcctccc tccggccctc 
atgccaccac ctcagtattt ctctccagcc 
ccccctgcgt ccccagcctt gcctctgcag 
cccctggcca tgccctgccg gaccattgtg 
gccgtagccc caccgggcgt ggctgccctg 
ggccaacctg tgtacccagc ggccttccca 
ccccatcaca cggtgcagaa tat.gagggcc 
cccacactgc ccccacaacc cgtgctgccc 
ttgcccccgc aacccacacg gccccctcaa 
ccaceiacctg tgctgccccc gcagccggca 
caccttcctg aacaagctgc tccagctgct 
ccagctccct atgctgtgga cgtcgccgct 
gcggtcctct cgccgcctct gccggaagtg 
cagttcccca gctccctggc cacggtgtct 
gccacacttc tgccaccagc aaacccaccg 
tgcccaactg tccagctgac ggtggaacca 
ccgcccggcc tcccgcagag ctgtgagagc 
gagctgagtg acagctgtga aggcgccttt 
cgaaaacacc accgcaggtc cacgcgtgcg 
cggcttacca tcttgaacgt gtgcaacact 
acgcacaacc acaagatggt gaccttcaag 
attgccacgt atatggtgga gcatgacttt 
gagcagatga aggatgtcat ggacaaggca 
gaccgtggct ccgacccagg gaccagcccg 
ggsrgaggaga gccgacaatc ccaagccaac 
accgggaaga ggtggttcat catctgtccg 
gaatcttcgc ccccacttcc tctaagctcc 
tgcgcctctg tgctcgcctg ccccctctct 
ctcctttgct ttcctcggga tgccactctg 



gcccccgacg gcggccccag ggaggaggcg 420 
gcggccgagg cgaagcctga gcccgggcgc 480 
gacgacgagg acgacctcaa ggccgtggcc 540 
gacatcgagc tgggccgcgg ttccttcaag 600 
tgggtggagg tggcctggtg tgagctgcag 660 
cggttcaagg aagaggctga gatgctgaaa 720 
tacgacttct gggagtccag cgccaagggc 780 
atgacctcag ggacgctgaa gacatacctg 840 
ctccgcagct ggtgccggca gatcctgaag 900 
cccatcatcc accgagacct gaaatgtgac 960 
gtgaagattg gcgacttggg cctggccact 1020 
ataggtactc ccgagttcat ggcgcccgag 1080 
gacgtctatg cctttgggat gtgcatgctg 1140 
gagtgccaga atgcggccca gatctaccgc 1200 
tttgagaaag tgcacgatcc tgaaatcaag 1260 
aaggaggaaa ggtacgagat caeiagacctg 1320 
ggcgtgaggg tggagctcgc ggaggaggac 1380 
ctctgggtgg aagaccccaa gaaactgaag 1440 
ttcaccttcg acctggagaa ggagacgccg 1500 
ggattcttcc acgagagtga cgtcaagatc 1560 
ttgatccagt ggcggcggga gaggatctgg 1620 
gtgggcagcc cggacaaggc caggggtccg 1680 
catgcacagg ctgggcagcc cgggccacca 1740 
ctcctgccac ctacgttgcc gaccagcgcc 1800 
agcggccagg gctctaccgt gtactcagac 1860 
ggctcccttg ccgacgcagc gccgtccccg 1920 
gaggggcccg tcctgccgca gagcctgccc 1980 
gcacctggct tgccggtggg ctctgtcccg 2040 
cacttcccgg atccggccat gagcttcgcc 2100 
cccacgggcc caggccagcc agcacccccc 2160 
acacccctgc cgcaggtcct ggccccacag 2220 
cacctgccac cgtacctggc tccagcctcc 2280 
ctccagatgc cacaggcgcc cctgcagccg 2340 
attcctgttg tgccccccat cacgcccctg 2400 
ccagacctgc cgaccgcgac tgtgcctccc 2460 
gtgatcttgc cgagcctcgc tgccccactc 2520 
gctgtgaagc tgccccaccc ccctggggcg 2580 
ccaaatgcac cggccactat ccccctgctg 2640 
tccattcatt ctgccgtggc ccagctccca 2700 
cagatggcgc ctactgacgt ccctccttcc 2760 
acccctccac agccggcact gcctccacaa 2820 
ccgcaaccca cgctgccccc tcaacctgtg 2880 
cctgtgctgc ccccgcaacc catgctgccc 2940 
ctgcctgtgc gccctgagcc cctccagccc 3000 
acaccaggga gccagattct gcttggccac 3060 
caggtcccca ccgtgcctgt gccaccggct 3120 
ctgctgcctg ccgcccctga gctcctgcct 3180 
gcctctgtgc agagtgtgcc cacccagact 3240 
ctgcctggcg ggcccgggat cgccagccct 3300 
gtccaagagg agcaggcctc acaggacaag 3360 
tatggaggtt ctgatgtcac ttctggaaaa . 3420 
ggagggggca ggctggaggg cagggcagcc 3480 
cgctcccggc aggagagggc cagccggccc 3540 
ggggacaaga tggtggagtg ccagctggag 3600 
ttcgacttgg acggggacgc acccgatgaa 3660 
atcctgcagg ccgagcggga aacgttcatc 3720 
gaggacatgc tcagcgagga cacagacgcc 3780 
ccacacctca gcacctgcgg cctgggcacc 3840 
gcccccgtgt atcagcagaa cgtcctgcac 3900 
gtggctgagc accccgcccc cgaggcccct 3960 
ctgccctgcc ctgccctgtt ccgcatgagc 4020 
gcttgttagt tgctctttct ggctctgcct 4080 
tgcccaggag ggtgcctgat ttcggggagt 4140 
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cctgacccga gcctgttgtc agagttggga ggggctctga gcagtgttgg gcaggccggg 4200 
tctcccatcc cgaggccagc gttcctgtgc agagccccat ccactggttc ttgccctgag 4260 
ccacatatgt ctgtgccatg ggctgagtgc cacgacaggc ccgtgtgaca gctgctgccc 4320 
acgcatgtgg aagctaggtg ggactcattc ctaattctgc cgttgtaatg agacttgatt 4380 
aaaacaccgc cacttttttg caaaaaaaaa aaaaa 4415 

<210> 38 

<211> 6306 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 7476595CB1 

<400> 38 

agacacagaa acagatgaca gcagaaacac cagaaacaga tgacagcaga aacaccagaa 60 
acagatgaca gcagaaacac cagaaacaga tgacagcaga aacaccagaa acagatgaca 120 
gcagaaacac cagaaacaga tgacagcaga aacaccagaa acagatgaca gcagaaacac 180 
cagaaacaga tgacagcaga aacaccagaa acagatgaat cagtgagtag ctctaatgcc 240 
tccctgaaac ttcgaaggaa acctcgggaa agtgattttg aaacgattaa attgattagc 300 
aatggagcct atggggcagt ctactttgtt cggcataaag aatcccggca gaggtttgcc 360 
atgaagaaga ttaataaaca gaacctcatc cttcgaaacc agatccagca ggcctttgtg 420 
gagcgggata tcctgacttt tgcagaaaac ccctttgttg tcagcatgta ttgctccttt 480 
gaaacaaggc gccacttgtg catggtcatg gaatatgtgg aagggggaga ctgtgctact 540 
ttaatgaaaa acatgggtcc tctccctgtt gatatggcca gaatgtactt tgctgagacg 600 
gtcttggcct tggaatattt acataattat ggaattgtac acagggattt gaeiaccagac 660 
aacttgttgg ttacctccat ggggcacata aagctgacag attttggatt atctaaggtg 720 
ggactaatga gcatgactac caacctttac gagggtcata ttgagaagga tgctagagag 780 
ttcctggata aacaggtctg tggcacacct gaatacattg caccagaagt gattctgagg 840 
cagggttatg gaaagccggt ggactgcrtgg gccatgggga ttatcctcta tgaatttctg 900 
gttggatgcg tgccattctt tggggatact ccagaggagc tatttggaca agtcatcagt 960 
gatgagatca actggcctga gaaggatgag gcacccccac ctgatgccca ggatctgatt 1020 
accttactcc tcaggcagaa tcccctggag aggctgggaa caggtggtgc atatgaagtc 1080 
aaacagcatc gattcttccg ttctttagac tggaacagtt tgctgagaca gaaggcagaa 1140 
tttattcccc aactggaatc tgaggatgac acaagttatt ttgatactcg gtctgagaag 1200 
tatcatcata tggaaacgga ggaagaagat gacacaaatg atgaagactt taatgtggaa 1260 
ataaggcagt tttcttcatg ttcacacagg ttttcaaaac tttttctaaa tgattaccta 1320 
gatgcacctg caaatgggcc agcactaccc tcctgtgtat gggaatggca tcgaggtaag 1380 
gatttccctg gagaaggtgg tagccagtct gtcctagagc caggacagaa gcttgctaag 1440 
tgtggactca gaccaggact gttctctggg ccatcaaaga caacaatgcc aacccctaaa 1500 
cactgcttcc ttctttgcct tgatactgaa agcaacagac ataaactcag ttctggccta 1560 
cttcccaaac tggctatttc aacagaggga gagcaagatg aagctgcctc ctgccctgga 1620 
gacccccatg aggagccagg aaagccagcc cttcctcctg aagagtgtgc ccaggaggag 1680 
cctgaggtca ccaccccagc cagcaccatc agcagctcca ccctgtcaga tatgtttgct 1740 
gtttcccctc tgggaagtcc aatgtctccc. cattccctgt cctcggaccc ttcttcttca 1800 
cgagattcct ctcccagccg agattcctca gcagcttctg ccagtccaca tcagccgatt 1860 
gtgatccaca gttcggggaa gaactacggc tttaccatcc gagccatccg ggtgtatgtg 1920 
ggagacagtg acatctatac agtgcaccat atcgtctgga atgtagaaga aggaagtccg 1980 
gcatgccagg caggactgaa ggctggagat cttatcactc acatcaatgg agaaccagtg 2040 
catggacttg tccacacaga agttatagaa ctcctactga agagtgggaa taaggtgtca 2100 
atcactacta ccccatttga aaacacatca atcaaaactg gaccagccag gagaaacagc 2160 
tataagagcc ggatggtgag gcggagcaag aaatccaaga agaaagaaag tctcgasiagg 2220 
aggagatctc ttttcaaaeia gctagcceiag cagccttctc ctttactcca caccagccga 2280 
agtttctcct gcttgaacag atccctgtca tcgggtgaga gcctcccagg ttcccccact 2340 
catagcttgt ctccccggtc tcceiacacca agctaccgct ccacccctga cttcccatct 2400 
ggtactaatt cctcccagag cagctcccct agttctagtg cccccaattc cccagcaggg 2460 
tccgggcaca tccggcccag cactctccac ggtcttgcac ccaaactcgg cgggcagcgg 2520 
taccggtccg gaaggcgaaa gtccgccggc aacatcccac tgtccccgct ggcccggacg 2580 
ccctctccaa ccccgcaacc cacctccccg cagcggtcac catcccctct tctgggacac 2640 
tcactgggca attccaagat cgcgcaagcc tttcccagca agatgcactc cccgcccacc 2700 
atcgtcagac acatcgtgag gcccaagagt gcggagcccc ccaggtcccc gctgctcaag 2760 
cgcgtgcagt ccgaggagaa gctgtcgccc tcttacggca gtgacaagaa gcacctgtgc 2820 
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tcccgcaagc acagcctgga ggtgacccaa gaggagcrtgc agcgggagca gtcccagcgg 2880 
gaggcgccgc tgcagagcct ggatgagaac gtgtgcgacg tgccgccgct cagccgcgcc 2940 
cggccagtgg agcaaggctg cctgaaacgc ccagtctccc ggaaggtggg ccgccaggag 3000 
tctgtggacg acctggaccg cgacaagctg aaggccaagg tggtggtgaa gaaagcagac 3060 
ggcttcccag agaaacagga atcccaccag aaatcccatg gacccgggag tgatttggaa 3120 
aactttgctc tgtttaagct ggaagagaga gagaagaaag tctatccgaa ggctgtggaa 3180 
aggtcaagta cttttgaaaa caaagcgtct atgcaggagg cgccaccgct gggcagcctg 3240 
ctgaaggatg ctcttcacaa gcaggccagc gtgcgcgcca gcgagggtgc gatgtcggat 3300 
ggccgggtgc ctgcggagca ccgccagggt ggcggggact tcagacgggc ccccgctcct 3360 
ggcaccctcc aggatggtct ctgccactcc ctcgacaggg gcatctctgg gaagggggaa 3420 
ggcacggaga agtcctccca ggccaaggag cttctccgat gtgaaaagtt agacagcaag 3480 
ctggccaaca tcgattacct ccgaaagaaa atgtcacttg aggacaaaga ggacaacctc 3540 
tgccctgtgc tgaagcccaa gatgacagct ggctcccacg aatgcctgcc agggaaccca 3600 
gtccgaccca cgggtgggca gcaggagccc ccgccggctt ctgagagccg agcttttgtc 3660 
agcagcaccc atgcagctca gatgagtgcc gtctcttttg ttcccctcaa ggccttaaca 3720 
ggccgggtgg acagtggaac ggagaagcct ggcttggttg ctcctgagtc ccctgttagg 3780 
aagagcccct ccgagtataa gctggaaggt aggtctgtct catgcctgaa gccgatcgag 3840 
ggcactctgg acattgctct cctgtccgga cctcaggcct ccaagacaga actgccttcc 3900 
ccagagtctg cacagagccc cagcccaagt ggtgacgtga gggcctctgt gccaccagtt 3960 
ctccccagca gcagtgggaa aaagaacgat accaccagtg caagagagct ttctccttcc 4020 
agcttaaaga tgaataaatc ctacctgctg gagccttggt tcctgccccc cagccgaggt 4080 
ctccagaatt caccagcagt ttccctgcct gacccagagt tcaagaggga caggaaaggt 4140 
ccccatccta ctgccaggag ccctggaaca gtcatggaaa gcaatcccca acagagagag 4200 
ggcagctccc ctaaacacca agaccacacc actgacccca agcttctgac ctgcctgggg 4260 
cagaacctcc acagccctga cctggccagg ccacgctgcc cgctcccacc tgaagcttcc 4320 
ccctcaaggg agaagccagg cctgagggaa tcgtctgaaa gaggccctcc cacagccaga 4380 
agcgagcgct ctgctgcgag ggctgacaca tgcagagagc cctccatgga actgtgcttt 4440 
ccagaaactg cgaaaaccag tgacaactcc aaaaatctcc tctctgtggg aaggacccac 4500 
ccagatttct atacacagac ccaggccatg gagaaagcat gggcgccggg tgggaaaacg 4560 
aiaccacaaag atggcccagg tgaggcgagg cccccgccca gagacaactc ctctctgcac 4620 
tcagctggaa ttccctgtga gaaggagctg ggcaaggtga ggcgtggcgt ggaacccaag 4680 
cccgaagcgc ttcttgccag gcggtctctg cagccacctg gaattgagag tgagaagagt 4740 
gaaaagctct ccagtttccc atctttgcag aaagatggtg ccaaggaacc tgaaaggaag 4800 
gagcagcctc tacaaaggca tcccagcagc atccctccgc cccctctgac ggccaaagac 4860 
ctgtccagcc cggctgccag gcagcattgc agttccccaa gccacgcttc tggcagagag 4920 
ccgggggcca agcccagcac tgcagagccc agctcgagcc cccaggaccc tcccaagcct 4980 
gttgctgcgc acagtgaaag cagcagccac aagccccggc ctggccctga cccgggccct 5040 
ccaaagacta agcaccccga ccggtccctc tcctctcaga aaccaagtgt cggggccaca 5100 
aagggcaaag agcctgccac tcagtccctc ggtggctcta gcagagaggg gaagggccac 5160 
acrtaagagtg ggccggatgt gtttcctgct accccaggct cccagaacaa agccagcgat 5220 
gggattggcc agggagaagg tgggccctct gtcccactgc acactgacag ggctcctcta 5280 
gacgccaagc cacaacccac cagtggtggg cggcccctgg aggtgctgga gaagcctgtg 5340 
catttgccaa ggccgggaca cccagggcct agtgagccag cggaccagaa actgtccgct 5400 
gttggtgaaa agcaaaccct gtctccaeiag caccccaaac catccacfcgt gaaagattgc 5460 
cccaccctgt gcaaacagac agacaacaga cagacagaca aaagcccgag tcagccggcc 5520 
gccaacaccg acagaagggc ggaagggaag aaatgcactg aagcacttta tgctccagca 5580 
gagggcgaca agctcgaggc cggcctttcc tttgtgcata gcgagaaccg gttgaaaggc 5640 
gcggagcggc cagccgcggg ggtggggaag ggcttccctg aggccagagg gaaagggccc 5700 
ggtccccaga agccaccgac ggaggcagac aagcccaatg gcatgaaacg gtccccctca 5760 
gccactgggc agagttcttt ccgatccacg gccctcccgg aaaagtctct gagctgctcc 5820 
tccagcttcc ctgaaaccag ggccggagtt agagaggcct ctgcagccag cagcgacacc 5880 
tcttctgcca aggccgccgg gggcatgctg gagcttccag cccccagcaa cagggaccat 5940 
aggaaggctc agcctgccgg ggagggccga acccacatga caaagagtga ctccctgccc 6000 
tccttccggg tctccaccct gcctctggag tcacaccacc ccgacccaaa caccatgggc 6060 
ggggccagcc accgggacag ggctctctcg gtgactgcca ccgtagggga aaccaaaggg 6120 
aaggaccctg ccccagccca gcctccccca gctaggaaac agaacgtggg cagagacgtg 6180 
accaagccat ccccagcccc aaacactgac cgccccatct ctctttctaa tgagaaggac 6240 
tttgtggtac ggcagaggcg ggggaaagag agtttgcgta gcagccctca caaaaaggcc 6300 
ttgtaa g306 

<210> 39 
<211> 7151 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> misc_featiire 

<223> Incyte ID No: 71824382CB1 

<400> 39 

aatatacgac ttaattgtat tcttttaaaa 
accctcaaaa tagatgtata tgggtgaaat 
gtgtgttgga tgtttaattc agcaccagca 
tttattttta aaaccatacc ttttaaaatt 
ataatttaaa ggaaaaccag cagaaatcga 
gcagttggag cagtttattt tggacgggcc 
ggagacatta ctggatatac tcatctgcct 
aagagagaag aacattctcg aatacctaga 
acaaatgcga ttacatagag aagactttga 
tggggagcftt gctgtagtaa aactaaaaaa 
gaataaatgg gaaatgctga aaagagctga 
attagtgaat ggagacaata aatggattac 
taacttatac ctggttatgg attattatgt 
atttgaagat agattgcctg aagatatggc 
aattgactca gttcatcagc tacattatgt 
gatggatatg aatggacata ttcggttagc 
agatggaacg gttcagtcct cagtggctgt 
ccttcaagcc atggaagatg gaaaagggag 
gggggtctgt atgtatgaaa tgctttacgg 
ggagacatac ggaaaaatca tgaaccacaa 
tgatgtgtct gaaaatgcta aggatcttat 
acttggtcaa aatggaatag aagactttaa 
ggataatatt cggctactgtg aagcacctta 
atcgaatttt gatgtagatg atgattgttt 
acatactgca ttttctggcc accatctgcc 
tgtactttct gatcggagct gtttaagagt 
tgttaatgtt cagaggactc tagacaacaa 
taagcgcctt gagcaagaaa aacttgaact 
tgtccaagct ctgcagtatt caactgttga 
aataaaaaac ttaaaagaag aaattgaaaa 
tttggaacag caacttgaag aagctaatgc 
acaaatcaag gcttatgaaa aacaaatcaa 
taaggaacta gtccaggcta gtgagcgatt 
acactgtcag aggaaactgg ccatgcagga 
attgcacacc caaaaacaga aacttgctcg 
cctggtgatg caaaaagttg aaagcttaag 
aaaagagctg gaagttcata cagaagctct 
acgtgaacag agtgagcact attctaagca 
aaaacaaatt agttactcac caggagtatg 
actaaagact gatttggaaa agaaaagtat 
aggaatacat gcaaatgaaa taaaaaatct 
gcaacttgct ctcaacaaag aaattatgat 
agaaagtcaa agtgaaaggg aggaatttga 
aaaagtgttg ttaactgaag aaaataaaaa 
tttgtatgag aacttaagta tacacaacca 
agacciagaaa gaatcagttg cacattggga 
cagcgatgaa aaggatgcac gagggtatct 
attggaggca ttaagaaatt ccagcttggg 
gcgtcgtttt gcgaaactgg atatgtcagc 
agaaataaga gccaaacagg ccatccaaga 
cataacagaa tgtaaactaa aagattcaga 
cgaacagctg ataaaggaca ctgaagagct 
ctcacagcat tctttcttgg catttttgaa 
agattccttt tcttcttcct catcttcact 
tgagaacaca tatgtatgga acccgagcgt 
tccttccaca tctagtgaag ctgagccagt 



atgcattaag tatatatttt atggtaattt 60 
tgaagacgct tcagttaagt gaggttactg 120 
ttgcatgaca gttgtttgaa taacaagtgg 180 
taggttcaga taatagtaaa agtcatcata 240 
agcaaacatg tctggagaag tgcgtttgag 300 
cgctcagacc aatgggcagt gcttcagtgt 360 
ttatgatgaa tgcaataatt ctccattgag 420 
atgggctaaa ccatttactt ctaaagtgaa 480 
aatattaaag gtgattggtc gaggagcttt 540 
tgcagataaa gtgtttgcca tgaaaatatt 600 
gacagcatgt tttcgtgaag aaagggatgt 660 
aaccttgcac tatgctttcc aggatgacaa 720 
tggtggggat ttgcttactc tactcagcaa 780 
tagattttac ttggctgaga tggtgatagc 840 
acacagagac attaaacctg acaatatact 900 
agattttggt tcttgtctga agctgatgga 960 
aggaactcca gattatatct ctcctgaaat 1020 
atatggacct gaatgtgact ggtggtcttt 1080 
agaaacacca ttttatgcag aatcgctggt 1140 
agagaggttt cagtttccag cccaagtgac 1200 
tcgaaggctc atttgtagca gagaacatcg 1260 
gaaacaccca tttttcagtg gaattgattg 1320 
tattccagaa gttagtagcc caacagatac 1380 
aaaaaattct gaaacgatgc ccccaccaac 1440 
atttgttggt tttacatata ctagtagctg 1500 
tacggctggt cccacctcac tggatcttga 1560 
cttagcaact gaagcttatg aaagaagaat 1620 
cagtagaaaa cttcaagagt caacacagac 1680 
tggtccacta acagcaagca aagatttaga 1740 
actaagaaaa caagtaacag aatcaagtca 1800 
tgtgaggcaa gaactagatg atgcttttag 1860 
aacgttacaa caagaaagag aagatctaaa 1920 
siaaaaaccaa tccaaagagc tgaaagacgc 1980 
attcatggag atcaatgagc ggctaacaga 2040 
ccatgtccga gataaggaag aagaggtgga 2100 
gcaagaactg cgcagaacag aaagagccaa 2160 
agctgctgaa gcatctaaag acaggaagct 2220 
actggaaaat gaattggagg gactgaagca 2280 
cagcatagaa catcagcaag agataaccaa 2340 
cttttatgaa graagaattat ctaaaagaga 2400 
taagaaagaa ctgcatgatt cagaaggtca 2460 
tttaaaagac aaattggaaa aaaccagaag 2520 
aagtgagttc aaacaacaat atgaacgaga 2580 
gctgacgagt gaacttgata agcttactac 2640 
gcagttagaa gaagaggtta aagatctagc 2700 
agcccaaatc acagaaataa ttcagtgggt 2760 
tcaggcctta gcttctaaaa tgactgaaga 2820 
tacacgagca acagatatgc cctggaaaat 2880 
tagactggag ttgcagtcgg ctctggatgc 2940 
agagttgaat aaagttaaag catctaatat 3000 
gaagaagaac ttggaactac tctcagaaat 3060 
tagatctgaa aagggtatag agcaccaaga 3120 
tacgcctacc gatgctctgg atcaatttga 3180 
gattgatttt ttggatgaca ctgatcccgt 3240 
caagtttcac atccagtcac ggtccacatc 3300 
taagactgta gactccactc cactttcagt 3360 



65/67 



wo 02/46384 



PCTAJSOl/47431 



tcacacacca accttaagga aaaaaggatg tcctggttca actggctttc cacctaagcg 3420 
caagactcac cagttttttg taaaatcttt tactactcct Stccaiagtgtc atcagtgtac 3480 
ctccttgatg gtgggtttaa taagacaggg ctgttcatgt gaagtgtgtg gattctcatg 3540 
ccatataact tgtgtaaaca aagctccaac cacttgtcca gttcctcctg aacagacaaa 3600 
aggtcccctg ggtatagatc ctcagaaagg aataggaaca gcatatgaag gtcatgtcag 3660 
gattcctaag ccagctggag tgaagaaagg gtggcagaga gcactggcta tagtgtgtga 3720 
cttcaaactc tttctgtacg atattgctga aggaaaagca tctcagccca gtgttgtcat 3780 
tagtcaagtg attgacatga gggatgaaga attttctgtg agttcagtct tggcttctga 3840 
tgttatccat gcaagtcgga aagatatacc ctgtatattt agggtcacag cttcccagct 3900 
ctcagcatct aataacaaat gttcaatcct gatgctagca gacactgaga atgagaagaa 3960 
taagtgggtg ggagtgctga gtgaattgca caagattttg aagaaaaaca aabtcagaga 4020 
ccgctcagtc tatgttccca aagaggctta tgacagcact ctacccctca ttaaaacaac 4080 
ccaggcagcc gcaatcatag atcatgaaag aattgctttg ggaaacgaag aagggttatt 4140 
tgttgtacat gtcaccaaag atgaaattat tagagttggt gacaataaga agattcatca 4200 
gattgaactc attccaaatg atcagcttgt tgctgtgatc tcaggacgaa atcgtcatgt 4260 
acgacttttt cctatgtcag cattgg^tgg gcgagagacc gatttttaca agctgtcaga 4320 
aactaaaggg tgtcaaaccg taacttctgg aaaggtgcgc catggagctc tcacatgcct 4380 
gtgtgtggct atgaaaaggc aggtcctctg ttatgaacta tttcagagca agacccgtca 4440 
cagaaaattt aaagaaattc aagtcccata taatgtccag tggatggcaa tcttcagtga 4500 
acaactctgt gtgggattcc agtcaggatt tctaagatac cccttgaatg gagaaggaaa 4560 
tccatacagt atgctccatt caaatgacca tacactatca tttattgcac atcaaccaat 4620 
ggatgctatc tgcgcagttg agatctccag taaagaatat ctgctgtgtt ttaacagcat 4680 
tgggatatac actgactgcc agggccgaag atctagacaa caggaattga tgtggccagc 4740 
aaatccttcc tcttgttgtt acaatgcacc atatctctcg gtgtacagtg aaaatgcagt 4800 
tgatatcttt gatgtgaact ccatggaatg gattcagact cttcctctca aaaaggttcg 4860 
acccttaaac aatgaaggat cattaaatct tttagggttg gagaccatta gattaatata 4920 
tttcaaaaat aagatggcag aaggggacga actggtagta cctgaaacat cagataatag 4980 
tcggaaacaa atggttagaa acattaacaa taagcggcgt tattccttca gagtcccaga 5040 
agaggaaagg atgcagcaga ggagggaaat gctacgagat ccagaaatga gaaataaatt 5100 
aatttctaat ccaactaatt ttaatcacat agcacacatg ggtcctggag atggaataca 5160 
gatcctgaaa gatctgccca tgaaccctcg gcctcaggaa agtcggacag tattcagtgg 5220 
ctcagtcagt attccatcta tcaccaaatc ccgccctgag ccaggccgct ccatgagtgc 5280 . 
tagcagtggc ttgtcagcaa ggtcatccgc acagaatggc agcgcattaa agagggaatt 5340 
ctctggagga agctacagtg ccaagcggca gcccatgccc tccccgtcag agggctcttt 5400 
gtcctccgga ggcatggacc aaggaagtga tgccccagcg agggactttg acggagagga 5460 
ctctgactct ccgaggcatt ccacagcttc caacagttcc aacctaagca gccccccaag 5520 
cccagtttca ccccgaaaaa ccaagagcct ctccctggag agcactgacc gcgggagctg 5580 
ggacccgtga gctgcctcag cactgggacc tctcgctctc cgctccctgc cactcgcctc 5640 
ctctcacttt catctcttcc ctccacctcg cctgctcggc ctgaaagcca ccaggggctg 5700 
gcagcagtag caggacaggg cttcaggagt tctgacgaca cgactctcag atccacgccc 5760 
ccagcctaac agcaacaaca aagacagact ttccgtagca gcttagatta acgttgattt 5820 
cattccatgc acttagagtt gctttcagta acattttacc cctactccca aaggtagctt 5880 
aaatagacag attacacaaa tgtaagtgat aagaataaga ttagacagat tttgctttca 5940 
cagtagagtc tcattatagt cctaaaatag ctcatgggct tctccgcatc cagaagggag 6000 
aattggtccc tggagrtggct cactaagctc ttaatcagca aacgcagtga gtatcaacct 6060 
gattgttgcc aggaaatcct tatgaattaa aacaatgcat attttactac agtacagagt 6120 
ttaaatgaat acataaatgt agaagtactg aatgtatata tttaaaagga gcctcttgta 6180 
ttcaacaaaa gatggatgca tatataagag agatgattta atttaaagaa atatgttgtt 6240 
tcttgtctgt aatgtaatgt aaagggtgga aaggcctcaa gctcacattt gtagagagag 6300 
agcgagagaa atcagagttc cctttattgc cctgtcctca aactggtcat aggctctagt 6360 
cacctgggga gctgtagaaa acacttgcag agccaggttt tgctggtttg gggcatgccc 6420 
tgggcaccag agctttaaca tttgaagcca cttcagcagc agcagcaaaa ggcgaactca 6480 
tctctaccca agatgtttct tttcctagtg gtggaatttg aacacttctc actttttatt 6540 
gtattttatc ttccgcagat aaatgtagaa atacacggtt ctgtcacctc tgatcccttc 6600 
catctgaaag ggtacaagga gtgttgtagc ttctgaaggt gcagaaaaca atttctaaaa 6660 
atgcttttat tcctgggcta atcctgtccc tccctaagtc gcagcgaggt gtctgtccca 6720 
gggctggaga tgcttcccaa ggaggagtct gttttgttga gagtgggcgt gggcttcttc 6780 
acataagcct ggggaaggaa gaaaaaacgg ctttcattac caaataatgt aaaacctcaa 6840 
aagcaagggc ttcaacagcc ttaaccaaat attattcccc atagccagtg gaaaatggat 6900 
gtgacaaccc cagtgcgcag gccagagtga gtgagcccag cacggcgctc cgactggctt 6960 
cctctctcag gtgctggatt gtggggttag tggcatttcc agctggattc ctcctgttgt 7020 
agttgccata aggaaatgag atgcagaatc agaaggatct atttctacag aatcatttca 7080 
ccagttaagc acatgagtag agaaagagat eiaaaataasia gtatctcatg aaggaaagaa 7140 
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aaaaaaaaaa a 7151 

<210> 40 

<211> 2378 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 3566882CB1 

<400> 40 

aggcagcagc cacagcgggg agtgcgcggc gcggggacag gaagagaggg gcaatggctg 60 
ccgaccccac cgagctgcgg ctgggcagcc tccccgtctt cacccgcgac gacttcgagg 120 
gcgactggcg cctagtggcc agcggcggct tcagccaggt gttccaggcg cggcacaggc 180 
gctggcggac ggagtacgcc atcaagtgcg ccccctgcct tccacccgac gccgccagga 240 
cctttgcagc ttctgtttcc ccactcccct ctatttacct agcgaagatt tcagacttcg 300 
gcctgtccaa gtggatggaa cagtccaccc ggatgcagta catcgagagg tcggctctgc 360 
ggggcatgct cagctacatc ccccctgaga tgttcctgga gagtaacaag gccccaggac 420 
ctaaatatga tgtgtacagc cccccgaccc tgccaccccg ggctggggtg atcttggatg 480 
ttcaactaag tcattcagaa agggttctct gcatccacag ctttgcaatt gtcatctggg 540 
agctactcac tcagaagaaa ccatactcag agctcacgtc acagctaaag gaaaggaaag 600 
ggttcaacat gatgatgatt attatccgag tgacggcagg catgcggccc tccctacagc 660 
ctgtctctga ccaatggcca agcgaggccc agcagatggt ggacctgatg aaacgctgct 720 
gggaccagga ccccaagaag aggccatgct ttctagacat taccatcgag acagacatac 780 
tgctgtcact gctgcagagt cgtgtggcag tcccagagag caaggccctg gccaggaagg 840 
tgtcctgcaa gctgtcgctg cgccagcccg gggaggttaa tgaggacatc agccaggaac 900 
tgatggacag tgactcagga aactacctga agcgggccct tcagctctcc gaccgtaaga 960 
atttggtccc gagagatgag gaactgtgta tctatgagaa caaggtcacc cccctccact 1020 
tcctggtggc ccagggcagt gtggagcagg tgaggttgct gctggcccac gaggtagacg 1080 
tggactgcca gacggcctct ggatacacgc ccctcctgat cgccgcccag gaccagcaac 1140 
ccgacctctg tgccctgctt ttggcacatg gtgctgatgc caaccgagtg gatgaggatg 1200 
gctgggcccc actgcacttt gcagcccaga atggggatga cggcactgcg cgcctgctcc 1260 
tggaccacgg ggcctgtgtg gatgcccagg aacgtgaagg gtggacccct cttcacctgg 1320 
ctgcacagaa taactttgag aatgtggcac ggcttctggt ctcccgtcag gctgacccca 1380 
acctgcatga ggctgagggc aagacccccc tccatgtggc cgcctacttt ggccatgtta 1440 
gcctggtcaa gctgctgacc agccaggggg ctgagttgga tgctcagcag agaaacctga 1500 
gaacaccact gcacctggca gtagagcggg gcaeiagtgag ggccatccaa cacctgctga 1560 
agagtggagc ggtccctgat gcccttgacc agagcggcta tggcccactg cacactgcag 1620 
ctgccagggg caaatacctg atctgcaaga tgctgctcag gtacggagcc agccttgagc 1680 
tgcccaccca ccagggctgg acacccctgc atctagcagc ctacaagggc cacctggaga 1740 
tcatccatct gctggcagag agccacgcaa acatgggtgc tcttggagct gtgaactgga 1800 
ctcccctgca cctagctgca cgccacgggg aggaggcggt ggtgtcagca ctgctgcagt 1860 
gtggggctga ccccaatgct gcagagcagt caggctggac acccctccac ctggcggtcc 1920 
agaggagcac cttcctgagt gtcatcaacc tcctagaaca tcacgcaaat gtccacgccc 1980 
gcaacaaggt gggctggaca cccgcccacc tggccgccct caagggcaac acagccatcc 2040 
tcaaagtgct ggtcgaggca ggcgcccagc tggacgtcca ggatggagtg agctgcacac 2100 
ccctgcaact ggccctccgc agccgaaagc agggcatcat gtccttccta gagggcaagg 2160 
agccgtcagt ggccactctg ggtggttcta agccaggagc cgagatggaa atttagacaa 2220 
cttggccagc cgtggtggct cacgtctgta atcccagcac tttgggaggc tgaggcaggc 2280 
agatcacctg agatcaagag tttgaggcca gcctggccata catggcaaeia ccctgtctct 2340 
gctaaaaata caaaatttag ctgggaaaaa aaaaaaaa 2378 
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